8.06: Quantum Physics II 


Lecturer: Professor Max Metlitski 
Notes by: Andrew Lin 


Spring 2021 
Recitations for this class are being taught by Professor lain Stewart. 


Introduction 


Professor Metlitski works in the Condensed Matter Theory group, working on problems about quantum matter at low 
temperatures. So quantum physics will hopefully become “close to our heart” too! 

The formal prerequisite for this class is a grade of C or higher in 8.05, but 8.06 is more challenging than 8.05, 
both conceptually and technically. Lots of stuff is packed into the class, so if we’re unsure about our background, we 
should talk to the course staff. 

Lectures, recitations, and office hours are all remote (and lectures will be recorded) — materials and problem sets 
will be on Canvas, and there is also a Piazza site linked to Canvas. We're encouraged to ask questions on Piazza (and 
to answer each others’ questions as well). 

Professor Barton Zwiebach’s lecture notes are the main course reading for this class (soon to be a textbook). 
Chapter 1 is on Canvas, and for each week, the problem set will indicate what chapter to read. (The syllabus also 
contains a list of other good textbooks, but Professor Zwiebach’s notes are the main source to consult. In particular, 
the level of this course is a bit higher than Griffiths.) 

Grading will be flexible in that our grade will be maximized among the following windows: 8 problem sets (with 1 
dropped) are worth between 25-40%, the term paper will be worth between 25-35%, and the final exam will be worth 
between 30-50%. 

The term paper has historically been a fun part of the class — details about this are on Canvas in a separate 
document, which we should read carefully, but basically it can be about any topic in quantum physics that hasn't been 
covered in 8.04, 8.05, or 8.06. The main relevant deadlines are that a proposal is due on March 26 (which requires 
our topic, references, and a peer-editor in the class). After that, a first draft is due on April 21, and the final version 


is due on May 6. (Since this is a CI-M, we need a C or higher on the paper to pass the class.) 


1 February 16, 2021 


In 8.05, a lot of the formalism of quantum mechanics was developed, and now we're ready to apply this formalism to 
real problems and phenomena in quantum physics (the fine structure of hydrogen, decay of excited states, tunneling, 
motion of charged particles in a magnetic field, and so on). And these problems basically exemplify the early successes 


of quantum mechanics, so they’re very important, and they illustrate a lot of physical concepts. 


To approach these problems, we'll need some new tools, and these tools are often certain approximations. In 
almost any branch of physics, we'll see these tools being used, so it's good to get them into our toolbox! 

We'll start with time-independent perturbation theory. |n 8.05 (and also in 8.04), we saw certain physical 
systems, like the harmonic oscillator and the hydrogen atom, and we solved these systems exactly (finding the energies 
and eigenstates of the Hamiltonian). We cannot find the exact solutions to most problems in physics, but often we 


can use the exactly solvable systems as a starting point: 


Example 1 


A diatomic molecule consists of two atoms separated by some distance r, interacting via a potential V(r) (with 


a minimum at some equilibrium distance r = 1). 


Any function near its minimum can be approximated by a parabola, so for oscillations close to the equilibrium 
separation, we can think of this system as (approximately) a simple harmonic oscillator. And at this zeroth order 
approximation, we do indeed know the energies and eigenstates. But then the anharmonicities turn on as we move 


away from equilibrium, and we're going to learn how to take these (small) deviations into account. 


Example 2 


A hydrogen atom sitting in vacuum can be exactly solved, but applying a small magnetic or electric field to the 


atom will shift the spectral lines (energy levels). 


Again, these problems can be solved by starting with the exactly solvable system, and we need to adjust the 
Hamiltonian a little bit to get to our actual system. 


With that, let’s start with the formalism. We can write our Hamiltonian as 
H = H + 65H, 


where H is our exactly solvable system and 6H is a small perturbation. (Note that H© and H are Hermitian 
because they are both Hamiltonians, so their difference 6H must also be Hermitian.) We'll also assume here that all 
of these terms are time-independent (we'll study time-dependence later in the course). 

Finding the energies and eigenstates of H is hard in general, but we will make an assumption that 6H is small 
compared to H(). It’s not entirely clear what this means yet — they are operators, not numbers — but we'll develop 


some criteria for this over time. And this motivates the modified problem where we write 
H(X) = H +. - 5H, 


where is a dimensionless number in the range [0, 1]. In other words, we have a family of perturbations H(A), where 
H(0) = H©) is our starting Hamiltonian, H(1) = H© + 6H is the Hamiltonian that we want to study, and we can 
interpolate between the two. But this A is also useful as an expansion parameter: even if 6H is not that small, we 
can make A small enough so that AdH is small. So expanding in A (as a series) is basically like expanding in 6H. 
Before we do any calculations, we can try visualizing what might happen to our energies and eigenstates as A goes 
from 0 to 1. At X) = 0, we know the energy spectrum exactly: for example, we might have a single (nondegenerate) 
ground state at some lowest energy, and then a two-fold degeneracy at the first excited state, followed by another 
nondegenerate state, and finally a three-fold degeneracy at the highest energy. If we “turn on” a finite A, we can track 
what happens to these energies. They should evolve continuously with A, so the ground state energy might wiggle 


a little bit in a continuous way (and so will the other nondegenerate state). But there are two states with the same 


energy at the first excited level, and those can either remain degenerate for all A, or (more commonly) they will split 
apart at positive A. (So understanding how this splitting takes place will be very important as we go on.) And this 
splitting of states can occur at different orders as well: perhaps in the three-fold degeneracy, two of the states evolve 
with the same “starting slope,” but not the third. So these are just some potential illustrations of the evolution, and 
the point is that there are a lot of options. 

We'll understand how to deal with nondegenerate states’ energy evolution through nondegenerate perturbation 


theory, and we'll address the energy splitting through degenerate perturbation theory (which is a bit more subtle). 


Remark 3. /n principle, for large enough 2 or if 6H is not small enough, the energies may cross. But we usually try 


to avoid these (for instance, by making X small), and we'll discuss this point later on as well. 


Let’s now begin to study nondegenerate perturbation theory. As before, we have a Hamiltonian H© with certain 
eigenstates | k() (labeled by an integer k > 0), such that 


HO) K) =m JA) 
for corresponding energies EO), We can choose these eigenstates to be orthonormal, so that 


(00/4 = be, 


and we'll order these states in nondecreasing energy, so that E) < EO) < E) <.-+. (Equalities are possible because 
of degeneracies.) If we focus on a level that is initially nondegenerate, that means we're focusing on some level | 119) 
so that EO, < EM < EO (we now have strict inequalities), but no assumptions are being made on degeneracies 
outside of level |n{)). 


So now we want to solve the eigenvalue problem with the full Hamiltonian 
H(A) |), = En(A) |n), - 


We know that the answer reduces to the unperturbed energy and eigenstate at A = 0, so F,(A = 0) = EO), and 
IN),-9 = [n). Our key assumption now is that the energy and state can both be expanded in a power series in 
A, so that 

E, (A) = EF +r60) + 2EO 4... , 


and similarly 
In), = n°?) +2 Jn“) +)? |n')) ap etig 


Here, the subscripts indicate the level, and the superscripts (0), (1),--- indicate the order. (If we're unsure how to 
“expand” a vector like |n),, we can think about doing so component-by-component.) So our goal is to find these 
expansion coefficients ey Ee) -»+ and |n)) , |n'°)) ,-++ (and note that these coefficients are all A-independent). In 
practice, then, we can compute the first few terms and truncate the power series expansion, and we hope (or check) 


that this is a good approximation. 


Fact 4 


We can always multiply the eigenstates by a constant, so we will not assume that |n), is normalized here (and we 


can normalize them at the end if we want). 


Our first step is to plug the ansatz for |n), into our eigenvalue equation, which we first rearrange as 


(Ho + 5H — E,(X)) In), =0. 


Plugging in our power series expansions, the term in parentheses becomes 
(H — EO) — x (Ee = 6H) — 262) —...— KEW _... , 
where we've grouped terms by 4, and it’s acting on the state 
}n'?) +|n() ) 432 Jn’) ) fe 


This looks pretty messy, but It isn't actually that bad — we solve this equation by separating things order-by-order in 


d. (After all, we have an infinite polynomial in 0, so we're trying to make each term of that infinite polynomial 0.) 


+ The only way to get a A° is to have no A terms at all: 


(Ho 2 E0)) n°?) =55)/), 


£, 


+ We can get a first-order > by taking one from the blue term or from the red term. This gives us (moving one 


of the terms to the other side) 


(#0 =) fo) = (e)—9) )] 


+ Similarly, there are three ways to get a second-order X: (0, 2), (1,1), (2,0). We move the last two possibilities 
to the right side to get 


(© — £0) [n®) = (ER — 6H) |r) +E [n) | 


+ The pattern is now a bit more clear: if we want the Ath order in A, the equation looks like 


(HO) — E00) |nt) = (ED — GH) [alD) + £2 |alh2) 4... + E00 |r) | 


Studying the structure here, the zeroth order equation gives us nothing new -— it’s dealing with the unperturbed 
state and energy, so it’s automatically satisfied. The first-order equation has two unknowns, EY) and |n™), which 
we can solve for. Once we do that, the only unknowns left in the second-order equation are E() and |n?), and we 
can keep doing this process recursively (order-by-order in X), so that we always have one unknown wavefunction and 


one unknown energy when solving each equation. 


Proposition 5 


We can choose our | n“*) vectors so that they are all orthogonal to |n). 


Basically, we can always take the piece of the vector proportional to |n and put it into | instead, and there 
are no normalization problems by our assumption. The argument is spelled out more carefully in Professor Zwiebach’'s 


notes, but for illustration, let’s write it out for the At equation, which is (copying from above) 


(1-9) 0) = (E21) 1). 


Suppose that (n| n is nonzero. Then we can find a new solution 


HY =o) +f 


which is also a solution to the equation, and we pick c so that (nn! = 0 (explicitly, we pick c = — (n |n)). 
And this kind of process can be done for higher-order terms as well. 

So now let's solve the A? equation: both sides of the equation are vectors, and thus we can check equality by 
taking inner products with the orthonormal basis | (for all nonnegative integers k). First, let's act with (n | on 


both sides, which gives us 
(n()] 4 = £0 | nf) = (nl EQ) = bH\n') . 


which is an equality of numbers. But notice that the left-hand side is zero here, because 
(H — 6) n°?) = 
and then we can take the adjoint of both sides, noting that our operators are Hermitian, to get that 
Co) (HO — £) =o, 
So looking at the right-hand side, we have 
= (nf 0) = 6H] n ) = 6 (n(n = (n(]54|n ) . 
which means that (because |n is taken to be normalized) 
£0) = (1 or]. 


In other words, we've found the energy correction (without needing to find the state correction)! And now let's 


generalize this to the kth order equation, which (again copying from above) is 
(Ho = E()) jn = (eo = 5H) ns) ) 4+ £2) jatk2) rae ar =) nfo?" 


If we act with the bra (n(| again, the left-hand side is again zero, and orthogonality means that everything except 


the first and last term on the right-hand side become zero! So we're left with 
0= (nf EM) — 5H] nD ) + Et) 


whenever k > 1, and we can even take out the EO) term in the first term because of orthogonality again. So that 


leaves us with our final result: 


Proposition 6 


The higher-order corrections to the energy are given by 


£09 = (10 or|n0), 


Notice that this requires us to have the (k — 1)th-order state correction to find the kth-order energy correction. 


So we'll start next lecture by understanding how to find the eigenstate correction [Jn 1 


Remark 7. Analyticity in our Hamiltonians is okay to assume when we have finite-dimensional Hilbert spaces, but if 
we have infinite-dimensional Hilbert spaces, It depends on how our operators act — the resulting power series may not 
converge. Such asymptotic series can still be useful, though — we just need to sum the right number of terms and 


make sure we don't add up too many of them! But we won't worry too much about that in this course. 


2 February 17, 2021 (Recitation) 


Professor Stewart's been at MIT for 17 years and is currently the director of the Center for Theoretical Physics, 


working on nuclear and particle physics (particularly quantum field theory). 


Fact 8 


Everyone in the class introduced themselves (name, location/time zone, year, what we did over IAP, and what we 


like doing outside of physics). 


Some recitation materials have been posted on Canvas, so we can always scroll back to see what we talked about 
recently (as long as we refresh our browser). We can also use gather.town as a proximity video chat app: we can talk 
to just the people who are near us, see a shared whiteboard, and generally have the room serve as a “virtual classroom.” 
So the idea is to work on our problem sets in here as a replacement to the physics common room, and it'll also be the 


space for office hours. 


Fact 9 
The office hours for this class are tentatively Tuesday 5-6 (Cagin), Wednesday 1-2 (Professor Stewart), Thursday 


11-12 (Professor Metlitski), and Friday 2-3 (Cagin). 


The point of recitations is not to teach much new formalism — instead, we'll review some concepts or do practice 
problems each time. But if we like or don’t like something in particular, we should let Professor Stewart know. 

We're doing perturbation theory right now, so we'll start by emphasizing the importance of perturbation theory ideas 
in physics. Basically any problem that we solve requires us to make some approximation (the system is nonrelativistic, 
or there are a certain discrete set of states we're considering). In 8.05, we often include these as part of the problem 
statement, but in 8.06 we'll be a bit more careful about what approximations we're making and what we can neglect. 

Our first step is time-independent perturbation theory, and in lecture we started by only considering nondegenerate 


states (meaning that the energy levels are different). To review, we wrote our Hamiltonian as 
H = HO + 6H, 


and our goal is to solve the H eigenvalue problem H|n) = E,|n) given the answer to the H() eigenvalue problem 
H() | n) = Eo) [n). The idea is to use the A as a parameter to expand in a power series: we write the perturbed 


energies and eigenstates as 
|n) = [n‘o>) + Jn) +)? a‘) foes, 


E, = EO +E 4 2EO) +... 


(Some textbooks may use slightly different conventions about A, so we should be aware of that.) A derivation done 


from lecture, assuming that | n*) (for positive k) are all orthogonal to [n©), led us to the result 
EO) = (n!|644|n) 


as well as the more general answer 
EO = (| 5H] 0-2), 


So once we figure out how to find | n‘*) (next lecture), we'll be able to find the coefficients of our power series. (The 
[x) (KO [6H |n°°)) 

n EO_ EO 

One of our questions to ask is “why can we do this expansion” in a specific problem, and in particular, what kinds 


first answer will turn out to be |n)) = ae , but we won't use that anywhere today.) 


of things can happen if the expansion is not okay. 


Example 10 


If something non-physical is being predicted (like an un-normalizable state, or a probability that’s less than O or 


more than 1), then either we did an incorrect calculation or perturbation theory doesn’t apply. 


Example 11 
There's also certain effects that are missed by perturbation theory — for example, if we approximate a double-hump 
potential by a quadratic function, we ignore the possibility of tunneling outside the humps. So if we care about 


that kind of situation, perturbation theory might not be the right tool to use. 


Example 12 
If we find that ie is on the same order of magnitude as ew then our perturbation isn’t a good approximation, 
either. Similarly (but perhaps less intuitively), if the gap between the energy levels E©) — EY is on the same 


order as our perturbation KES, then we might also run into problems. 


To illustrate how some of these situations might come up, let’s do a particular calculation: 


Example 13 


Consider two levels in a physical system, forming a two-state system. Let's try to analyze the question of “potential 


crossings’ of energy levels more carefully. 


The unperturbed Hamiltonian for this system is then 


(0) 
140) — F5 0 

¢ £2” 
and EO), Suppose that our perturbation looks something like 
Wy Wo : 
Wo Wi] 
let's try to understand how to solve this system exactly. The idea is that we'll switch to Pauli matrices: the diagonal 
terms of the perturbed Hamiltonian H are EO + dw, and £0) —w1, and we want to write these as E—A and E+A 


5 : (0) 
(if the starting energy levels are E>} 


AOH = X 


(just by making a change of variables). Once we do that, the Hamiltonian can be written as 


_ E-A AW 
AW E+A 


The exact eigenvalues here are 


A=EtSA2+ (Awo)?, 


and on problem 1 of our problem set, we'll see that we can get the first-order term here (after expanding the square 
root) using perturbation theory. But returning to the motivating question: A is the energy splitting, and Awp is the 
scale of the perturbation, so this is an example where we need the energy difference to be much larger than the 
perturbation for the square-root expansion to be valid. 

So looking at the form of the perturbed energies, we won’t have any energy crossings unless the entire square root 
term Is zero, meaning Wo = A= 0. And tuning two things to zero is very difficult, so we should not expect crossings 


in the generic case. 


3 February 18, 2021 


First, a quick review: last time, we started the study of non-degenerate perturbation theory, and the setup is that we 
have a perturbed Hamiltonian H(A) = H©) + 6H (where the second term is small compared to the first). When 
trying to solve this H(A) |n), = En(A)|n),, we expanded a power series in terms of our parameter 2, and last time, 
we found the expressions for the energy coefficients EW) E(), -++ by plugging our ansatz back into the Schrodinger 


equation and collected order-by-order terms, which we could solve recursively. The first result we found was that 
£0) = (nar), 


and we arrived at this by basically “looking at the | n° )-components” by putting a (n)| on both sides of the order- 


by-order vector equations. So if we instead try looking at components along (kO], we find that 
(«O/H ny Eni) = (i) 20) < bH|n) 


On the left-hand side, if we have the operator act on the bra instead of the ket, we get the expression (E _ 
£0) (kO|n), Meanwhile, on the right-hand side, the term («()| £0 |n( ) is zero by orthogonality, so we end up 


with 
(k|5H|n) 


EO) _ EY 


(EO — £) (K/n®) = (K]5H|n ) ete (9/0) —_ 
and since the numerator is a matrix element, we'll introduce the notation 
bie = (K]54|n ) 


and thus we have 


5H 
(0)) 2), = OF _ 
(k n )= EO — EO 


This is actually enough to tell us the first-order correction |n): because our initial eigenstates form a complete 


orthonormal basis, we can write the resolution of the identity 1 = }7, | «0 (k©| 
Jn ) = - KO) («)|n ) 
k 


When k = n, we have (n | n@ = 0 by our “useful choice” from last lecture! So we can just restrict this sum to 
terms k £ n, and thus we find that 


J) = oor lM) 


(0) 
kén &k En 


Basically, if we want to calculate the first-order correction to the state |n), we get potential contributions from all 


unperturbed states |k()). 


Remark 14. This is the first time we’re encountering an energy difference in the denominator, but it'll come up much 
more in perturbation theory. And notice that this is the first time that we need the nondegeneracy (otherwise, it’s 


possible that we'll have the denominator EO) — E) being zero!) 


Fact 15 


For the purpose of this course, the perturbations aren’t going to introduce new degrees of freedom, because H(° 


and 6H live in the same Hilbert space. So the resolution of the identity is indeed valid. 


Last time, we also found the Ath-order correction to the energy 
0 = (0644) n), 


So if we plug in 2 = 2 and our expression above for [n@), we find that 


6H 
E0? = (not) = — > SH (non | 4), 
k#én Ek n 


which further simplifies by our definition to 


OAK nd Ank 

E? =—-) 
0 0) 
ree Ee 


But because 6H is a Hermitian operator, (k|5H|n@) = (nO |5H| KO)”, so 6Mkn = 6H*,. Thus, we have our final 


answer 


|6Hkn|? 
E, =-) a0 _ co | 
mre Mer 


For visualization, we can expand the numerator out explicitly as well: 


(n) |6H] KO) (k©|5H|n©) 
EY = y (0) _ -(0) 
k#én E; E; 


So this numerator can be thought of as starting with our unperturbed state |n), transitioning to | (>) (through the 
second term in the numerator), and the transitioning back to |n) (through the first term in the numerator), paying 
the energy difference in the denominator. 


In summary, we've now calculated |n), to first-order: we know that 


5H 

— | y() kn (0)\ 4 2 
In), = | ) D> ao mo x‘) + 0002) | 
k#n —k n 


and we've also calculated the energy E,(A) to second order: 


nO [5H] KO) (k|5H|n) 
EO _ FO) 


En(A) = EM) + A6Han — 7 5° ( + O(A9) |. 


kAn 


We can already say some things about our results now: for example, we can try taking n= 0. 


Proposition 16 


The first-order approximation to the ground state energy is always an overestimate (for any value of A): 


E©) + X6Ho9 > Eo(A). 


Proof. We know that 
EQ) + AbHoo = (0 HO) + a (05H|0) = (0H + r6H|o) = (0 H(ary]o), 


and now the variational principle (with [0° as our test state) tells us that this must be at least the ground state 


energy of H(A), which is Eo(A) as desired. (And as a reminder, we prove the variational principle by expanding out 


the original ground state |o) as a linear combination of the new eigenstates. ) 


This is in fact consistent with the Taylor expansion above: notice that the second-order correction for the ground 
state EC) is always negative, because the numerators (squared norm) and denominators (differences to the ground 
state energy) are always positive, and there is a —\? in front. But it's more powerful because we don't need to assume 
that we have small A! 

And now motivated by that discussion of positivity, if we turn our attention to arbitrary n, we have 


lO Hkn|? OH kn|? lO Hkn|? 
EY = » 0 0 S$ 0 0 +> 0 0 
A  e-P Lw—P 


(we've adjusted a few signs in the second term). So energy levels above n contribute a negative second-order correction, 
pushing it down, while energy levels below n contribute a positive second-order correction, pushing it up. So the levels 
kind of repel each other, and this principle of level repulsion will come up some more in the rest of this course! 

So now that we've figured out the correct expressions, we'll turn to discussing the validity of this approach: when 
is it okay to write everything out as a power series? Basically, what does it mean for the operator AdH to be small, 
and how small does it need to be? 


We'll do a simple case with a 2-dimensional Hamiltonian to study this: 


Example 17 


Suppose we have the 2 x 2 matrix Hamiltonian 


ae 


HA) = HO + 6H = 


This is a problem which is simple enough that we can get the exact eigenstates and energies, and we can then 


compare the answer to perturbation theory. The exact solution comes from diagonalizing the matrix: we have 


4 6) 1) _ p10) ave 
EOje SS geek ae 


2 2 ( £0) _ pO ) . 
2 


AO +00 , eee 


We'll write this expression as 


Ex(A + f 
20) == F(z), 
where z= —*“l and f(z) = V1+2Z2. But the perturbative solution is basically a Taylor series expansion of this 


(EYE )/2 


10 


exact answer around z = 0, meaning the Taylor series expansion is the expansion of f(z): 


ze 76 zB 
1+z2=14 aes Gee O@). 


we can ask what the radius of convergence is for this series. This function is analytic around z = 0, but there are 


branch cuts at z = +/ (distance 1 away from 0), so complex analysis tells us that this will converge for |z| < 1, 
meaning that we require 

(0) (0) 
Ej — & 


A|V 
Vi< 2S 


Fact 18 


Generically, we're saying that all of the off-diagonal matrix elements of the perturbation 6H (in the original 


eigenstate basis) must be much smaller than the energy differences for our perturbation to be valid, and this 


ends up being a general criterion. 


Going back to the more general results we've obtained, recall that the first-order correction to the state gives us 


omreO — here, the numerator is the matrix element of the perturbation, and indeed we're asking for 
kon 


that to be much smaller than the denominator. But the diagonal entries are much more benign — if our 6H were fully 


coefficients of 


diagonal, then we would actually preserve our eigenstates, and there wouldn't be as many issues. 


And now we're ready to apply this formalism to an important example: 


Example 19 (1D anharmonic oscillator) 

Let HO) = es aE $ mux? be the Hamiltonian of the simple harmonic oscillator (in which the frequency of 
oscillations is classically independent of the amplitude, meaning that the energy levels are equally spaced in the 
quantum treatment). We add a small anharmonicity proportional to x*, which will make the frequency depend a 


bit on the amplitude (classical) and make the energy levels not exactly equal (quantum). Specifically, let 


NOH = hc eh 


where d = fe is the length scale (dictating the width of the Gaussian for the original ground state). 


Our full Hamiltonian is then 
je ieee (*)’ 
(A) = 5 + pmax" 4 wa): 


and we want to do perturbation theory on it. A good strategy here is to use raising and lowering operators: 


¥ = a(a+ a‘), 


so we Can rewrite ; 
dbH= qhula seal. 


We'll start with the first-order energy correction to the ground state: 
1 
B= (0054/0) = Shu (0|(a i aty*|o ) 
Using the algebra of a and al, we find that this energy coefficient is 3hw, meaning that we estimate 


1 3 
E9(A) = ah + AZ fw + O(d?). 


11 


We can then calculate the second-order energy correction as well, meaning that we want 


|6Ho|* 
EY =-)S 
0 0 
k #0 e )— ES 


We know all of the unperturbed energy levels EF) = (k + 5) fw, and also 
1 
BHyo = hs (K|(a + a!ys|o : 


In principle, these kinds of problems can be hard, but in this case only finitely many terms can contribute to the final 
energy coefficient, because we can only raise 0 at most four times! And the states that we can get out of x*|0) must 
all be even wavefunctions, because x* and |0) are both even. So that means that the only states that contribute at 
all are k = 0,2, 4. Indeed, we find that 


(a+ at)*|0) = 2V6|4) + 6V2|2) + 3]0), 


SO 
fi fi fi fi 
5Hao = 7 2v6 2 > ve, oe a 6v2 = 3v3. 


(Remember that the second-order correction only sums over k 4 0.) And now plugging these expressions in, 


NSD 2 
e(2) — __loheol? |5Haol2 (Pu) (32) (tw)? (8) =e 
o 7 ES) = 0) 30) = £0) = hu ih oar is 
And our answer now is , ; - 


People have in fact calculated this series to high order: the next terms are 3333 and — 305 4 And we can tell that 
the coefficients here are actually growing, but in fact we find that the radius of convergence is 0. So this is our first 
example of an asymptotic series — the way we use these is that for > small, the first few terms will still look like we 
have convergence, so we just cut off the series at some point (and there are also techniques for resumming this series). 

The reason this method breaks is that Eo(A) isn’t actually analytic near ) = 0, but that doesn’t necessarily mean 


~V/ near X > 0, which is not analytic but goes to O very quickly). And 


it has to be large (for example, consider e 
we could have guessed from the start that we wouldn’t have had an analytic function, because the potential would 
have been unbounded from below if A were any negative real number, meaning the ground state energy wouldn't be 
well-defined at all! Many other physical examples also lead to these kinds of asymptotic series, and we should keep an 


eye out for this in the future. 


4 February 22, 2021 (Recitation) 


(We'll usually start recitation with questions and clarifications about the problem set, but | won't write those questions 
and answers down here. ) 


This problem we'll do today is similar to the ones that we'll encounter on our homework: 
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Problem 20 


Suppose we have two square wells (where one is a perturbation of the other). Specifically, let the starting 


Hamiltonian be H() = + Voc-well, where Vic-wel! is the potential which is 0 for x € [0,L], and consider the 


perturbation 


—Y 0<x<L/2 
AOH =X 
0 otherwise. 


We know the solution for the (unperturbed) infinite square well already: since the wavefunctions must vanish 


outside of the range [0, L], the normalized eigenfunctions take the form 


nen? h? 
2mL2 


is different from the height from L/2 to L: we want to know what this perturbation does to our energy eigenfunctions. 


for n> 1, with energies F() = 


. And the perturbation changes our square well so that the height from 0 to L/2 


Our goal (in breakout rooms) is to find E and yy (the perturbation to the ground state). 


Solution. Recall that the formulas for the coefficients are 


EO = (n]54]n), nl?) = “Sw Hen 


(these are the same formulas as from lecture, but we're using notation like yo instead of [n®)). Because we don't 


have a matrix representation of 6H, the easiest way to calculate the matrix elements is to integrate directly: 


(7a) v6 ["Fan(°2)] 


(We can compute this using symmetry arguments, or we can use the power reducing formula cos2x = 1 — 2 sin? x.) 


t AM which is exactly half of the well decrease. 


Vo 


Giese 
a: 


In other words, all of the energies E, decrease by the same amoun 
(This makes sense, because particles spend half the time in the left side and half the time in the right side, no matter 
which eigenstate they're in, since the squared (unperturbed) wavefunctions are all symmetric about 5). 


Next, to find the perturbation to the ground state, we need to calculate the values of 


Lj2 2 k 
1 (K|54|1) = -V | 7 sin ( *) sin (=) dx. 
0 


Doing some manipulations gives us the answer for any k > 2, 


2k ku 
ml —F) cos 


Vo 


and this is nonzero only when k is even (in which case our answer is (—1)*/?). The idea here is basically that the 
even wavefunctions are antisymmetric around 5, but the odd wavefunctions are symmetric around 5. So the overlap 
from 0 to 5 will be the same as the overlap from & to L for odd k, but the total overlap is 0 (because our starting 
eigenfunctions are orthogonal)! Therefore 6H, is zero for odd k. 


So returning to the expression, 


(1) ) OA 
YO) = = Vi ) Fw (0) 
kl k Fy 


13 


(0) 


we can now plug in dHxi and E,-’ to get our answer: since E) — £0) = (k?-1)2% 


2mL?? 


Gis (0) 2kVo 2mL* 
wr) =— do vee) ( m(1— Ry *) mf? (k2 — 1)’ 


k even 


we have 


and we could simplify this a bit more but won't get too much insight out of it. 


We'll pose another question as well, related to the harmonic oscillator, just to remind us of some relevant formulas 


for the problem set (even though we don’t have time to solve it yet): 


Problem 21 


Consider the harmonic oscillator H) = = = me for a charged particle, placed in an electric field in the 


x-direction, so that our perturbation is 
AOH = —dAqEx. 


This is a linear perturbation (we talked about the cubic and quartic perturbation in other parts of the course). 


We'll think about the same quantities Eo) and |n@)) next time, but the useful harmonic oscillator facts to remember 


are the following: 


- We can write X in terms of the raising and lowering operators 
X= (ata), p= —(al—-a), 


where d? = an and we can check our work using that [a, at] = 1. 


+ We can rewrite the Hamiltonian as HO = fw (atat+ 5), so that 


1 | = (143) mala 


¢ The action of the raising and lowering operators on the energy eigenstates is 


at |KO) = VRFI|(kK+N), alk) = Vk|(kK- 1). 


5 February 23, 2021 


Last time, we were discussing the ideas of time-independent perturbation theory by considering Hamiltonians of the 
form H(A) = H© + 6H. We've been talking about energies that are non-degenerate so far in the first two lectures, 
and we solve the problems by assuming we can do power series expansions in A. 

Today, we'll discuss degenerate perturbation theory: we already know that we'll have some more trouble because 
the energy differences E°) — Eo in the denominators can now be 0. So before we get into the formalism, we'll do a 


quick example: 


Example 22 


Consider a Hilbert space spanned by two states, and take the Hamiltonian to be the 2 x 2 matrix given by 


1 0 01 aN 
H= He). 6H = +) = 
01 1 poe 
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The unperturbed Hamiltonian H© has two energy levels, and they’re both degenerate: we have EO = EO) =" 


: ; 1 0 
with eigenvectors , . 


Now let's forget everything we said about non-degenerate perturbation theory not working, and let's blindly apply 
the formulas and see what we get. Then the energy corrections to first-order, according to non-degenerate perturbation 
theory, are 

E,() = EO? + 6A =14+2A-0=1, 


E>(d) = ES + 6H =14+2-0=1. 


So it looks like the energies don't change to first order, but this is not actually true: in such a system we can solve 


directly for the eigenvalues and eigenvectors for F | they are a ] with eigenvalue 1+ A and B | with 
eigenvalue 1 — » (because the matrix is 1+ Aa,,). So the eigenvalues do split at first order, and something is wrong 
with our non-degenerate treatment. 

Also, the unperturbed eigenvectors that we wrote for H©) above are in a completely different direction from the 
perturbed eigenvectors for H(A) that we directly calculated. So if we're not careful, it looks like the eigenvectors 


“jump” once A becomes positive (even though all of the energy levels and eigenvectors are supposed to be continuous). 
: : 1 0 : : 
But what's really happening here is that when we chose ; and | above, that was only one potential choice for 


our eigenvectors — any linear combination of those two is also an eigenstate because we have a common eigenvalue! 
But for any positive A, because our two energy levels split and become nondegenerate, we can't allow for an arbitrary 
combination anymore. So determining what the split state reduces to at infinitesimal is a key step in degenerate 
perturbation theory. 

So now we're ready to dive into the formalism: we're starting with a set of energy levels that are N-fold degenerate 


at some energy E(), In other words, this means we'll have energy levels that look like 


0 0 0 0 0 
£0 << ons < EP = 0) = Ey < Ey Soe 
Also, we'll use an orthonormal basis |n©, 1), [n©, 2) attr [n©, N) for our states at this energy level E so that 


H) jot), ky = £0) ja), ky, (nl), k|n(, e) =. 
These N states span the degenerate subspace 


Vy = span {|n, L) ye ; [a nt 


so that any linear combination of these N states is also an energy eigenstate with the same energy EO) And the rest 


of the energy eigenstates span the rest of the Hilbert space H, in that 
H=VyeVi 
for the orthogonal complement V, spanned by all other energies: 
V, = span {|o) [pF nf : 


We should notice that we're using a different labeling for states in the degenerate subspace and otherwise: only one 
label p for everything outside our degenerate subspace. And for all such states |p», E©) is always different from 


FO) so (pO |n©, k) = 0. And we'll also additionally assume that we have orthonormality of those states, meaning 
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that (p'0)|q)) = dpg. 
So now we can turn on the perturbation A: we want to understand how the states evolve, and usually they will split 


into N states of different energies, which we'll label as |w=1) , |Wa2),--- , |Win). These eigenstates must satisfy 


H(A) |W), = Er(A) |W), 


and our goal is to find E;(A) and |q,;), — we'll do this again by trying to find a power series expansion for the state 
and the energy 
Ibi) = fw) + awh) +a? |W?) +>, 


E)(d) = EO +60) + 7EO 4... 


As \ — OT, we can indeed check that the energies approach the perturbed £0), but the states will approach jv). 


We know that lui?) must be some very particular eigenstates of the unperturbed Hamiltonians, and we don't quite 


know what they are yet: we Just know that 


N 
HP) = SHA) 
k=1 


for complex coefficients a;,., because each of those eigenstates lives in the span of our original [n©, k)s. So we need 


to find the a/xs to find the Jwf”)s, which we'll call the good states. We know that for any positive X, if the energies 


split, the good states must form an orthonormal basis. So we have (vie?) = Ojj. 
So now we're ready to do a similar strategy as in the non-degenerate case: plug our solutions into the eigenvalue 


equations, and expand them order-by-order. We'll then find that at zeroth order, we have 
(HO — £0) lyf) =o, 


at first order At, 
(HO — £1) ly) = (EM) — 5H |w), 
and at second order )?, 


(HO) — E10) lpi?) — (£9) — 6H) |wi)) + EG? fof). 


These are the same as before, Just with some extra subscripts, but let’s see what they mean this time. Again, the 
A = 0 equation doesn’t tell us anything new — we already knew that yo is in the degenerate subspace. So we turn 
to the next equation, and the first thing we need to do is to determine these good states jw) (compare this to 
the non-degenerate case, where we already knew the zeroth order state because there was no ambiguity). So we'll 
determine re) by hitting the A+ equation with (n, | for arbitrary 2€1,2,--- , N: 


(1), e] A) — EO yl) = (nf, eC? — SH] yf). 
Just like before, applying H©) — £o to the bra on the left-hand side gives us 0, and then rearranging gives us 
(10), 54] ¥() = EL) (n, ely). 


But we parameterized our yo as the linear combination 4 |n®, k) ak, SO plugging that in gives us 


N N 
3 (n,el6H|n®, k) ape) y- (nf), en, k) ark 


k=1 k=1 
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But our [n©, k)s are assumed to be orthonormal, so the right-hand side collapses to only have the term where k = é: 


N 
S- (nf), e)5H|n, k) alk = Be aig 
k=1 


We'll now make this more transparent by rewriting the matrix elements 
[OH]ex = (0, e)54|n, k) : 


here, k, 2 run over the states in the degenerate subspace that we already know, and then 6H can be thought of as an 
N x N matrix whose entries are exactly those elements above. (Basically, we're just looking at 6H projected to Vy.) 


And this re-interpretation as a matrix means that 


N 


i 
So[5H]ex are = EM ar. 
k=1 
ai 
a ; : a/2 
Here, / is just some fixed index, so a; can be thought of as an N-component column vector a; = _ |, and the 
ain 


left-hand side can then be thought of as a matrix multiplication 


([5H] - ap = EO) (are => | [SH]ar = EC) ay |, 


So the coefficients of a; must be the eigenvalues of the N x N matrix 6H, and thus the first-order corrections are just 


the eigenvalues of that matrix! And those NV different resulting eigenstates will be the N good states lw). 


Fact 23 
If the eigenvalues of [6H] are indeed distinct, meaning that a x ES for all / A J, then we say that the 


degeneracy is lifted at O() (the energy levels “leave’ > = 0 with different slopes). We hope for this situation, 
because then the a; eigenvectors are unambiguous and thus we've found the “good states” already. But it’s possible 
that some of the eigenvalues of [6H] are the same, and thus some energy levels don't split off until second order 


(or not at all). 


Often this is because there is hidden symmetry in the problem, but regardless, we don’t actually know all of the 
good states in this case just from this calculation (because EW) = ay implies that any linear combination of a; and 


ay will be an eigenvector). So we'd then need to look at higher-order A, and we might do an example of this later on. 


Fact 24 


What we've basically done above is diagonalize [6H] using the good basis, meaning that (wv? lse4|y() = EPO. 


To see that, note that 


(rs) = 


N WN 
k=1 £=1 
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where we've turned the double sum into a matrix multiplication, and this simplifies to 
1 1 
= En) yar = En) 51 


(by choosing our a;s orthonormal). In general, 6H and H will not commute, so we can’t actually simultaneously 
diagonalize the two operators. But the point is that [5H] is only a block of the full matrix 6H: if we use the good 
basis for Vy, the block of the full 6H corresponding to Vy will be diagonal, but that doesn’t mean that the good states 
are actually eigenstate of the full 6H: they’re only eigenstates of 6H restricted to the Vy subspace! After all, the 


components of the matrix 6H might have some contributions between Vy and V,: a resolution of the identity gives 


suf) ~ (3%) (oP +o) (om) ao). 
a p#n 
and evaluating this gives us 


= Soe) (oP [5H] ¥I?) + > fp) (o5H] i) = EO? Ly!) + Sp) (o5m|y)). 
’ oer pAén 


In other words, the first term behaves as we want it to, but the second term makes ae) not an actual eigenstate. 
Finally, we'll elaborate on this point next time, but solving [6H] - a; = Ee ay can be difficult if we have a large 
degenerate subspace. So we'll examine how to use symmetry to simplify some work and get the eigenvectors without 


having to do very much! 


6 February 24, 2021 (Recitation) 


(As usual, problem set discussion is omitted.) 
Today, instead of looking more at the harmonic oscillator, we'll discuss the Van der Waals force, which is a way of 
thinking about how two neutral hydrogen atoms interact with each other (at a higher order than the ordinary Coulomb 


force). The idea is that we have two atoms A and B which classically look like the diagram below: 


lA 


yu 


p 
A 


We'll let the horizontal axis be the 2-direction, and we want to look at the limit when R = |R| > |7], |73| (where 
Fa and rg have typical length approximately equal to the Bohr radius a9). To do perturbation theory, we're saying that 


our original Hamiltonian treats the two atoms as if they were completely separate: 


(HB _@),(&_# 
0 T p 
2m fa 2m fg 


Since the Hamiltonian is a sum of two terms, the solutions for wavefunctions are just direct products |ng, £4, Ma) |Ng, ge, MB) 


of the individual hydrogen atom wavefunctions, and the energies are similarly 


e? 1 e? 1 


2a) 1% = 2an N® 


(we're ignoring spin for the problem). So now if we want to do perturbation theory, we need to account for the six 


different pairwise electromagnetic interactions between the four charges. Two of them are already being considered, 
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and the remaining terms are 


2 2 2 


e e e 
R|R+ie—fl |R+7el |R—Fal 


To use this to solve our problem, we're going to need to do some expansion of the distances (which gets a bit 


dH= 


complicated), which results in the expression 


2 2 2 2 


e 
~ RS /ixgp— xa)? + ve —ya)2 +(R+28—zay?-/xB +2 +(R4+z8) Jxa + Vn + (R+ Za)?’ 


e e eC 


6H 


where we're letting the A proton be at (0, 0, 0) and the B proton be at (0, 0, R) (and then (xa, ya, Za) and (xz, Ye, ZB) 
are the coordinates of the two vectors Fa and fg. Then expansion of the square roots (in Taylor series) gets us the 


expression 


ee & 1 BTA, 3(Zg—Za)? 1 (xa — xB)? + (Ya — ye)? + (Za — 28)? 
R 2 R2 2 2R2 


2 2 
3 ZB 3 Z4 


e? Zp 1 e? ZA 1 
= (1 aoe spa + ¥B + 22) q (1 ee spa 0h + vat z8))) 


The overall system is neutral, so there shouldn't be a long-range potential, and indeed we can see here that zeroth 


and first order terms in the x, y, z all cancel out. So if we expand out those second-order terms, we get a pretty simple 


expression for the perturbation: 


R4 


€? 1 
0H |= RS (xaXB + Ya¥e — 2ZaZB) + O (Gx). 


We can then rewrite this in the dot product form 


2 
CS By Beep es 


We'll now treat f and fg as quantum mechanical operators (they're decoupled from the point of view of H()), and 
now we're getting close to the van der Waals force (it’s really a dipole-dipole force, so we could have written this in 
terms of electric fields instead if we wanted). To do our perturbation theory, we'll think about the ground state: the 
first order shifts look like 

(100; 100|F4 - Fg|100; 100) 


(where the semicolon means we have a product state), and because the A and B parts are independent, this can be 
written as 
(100|F4|100) - (100|7|100) . 


Remark 25. For more mathematical rigor (because f4- fg is a dot product and not separated on its own), we can first 
write this as the sum }_; (100; 100|riq - rig|100; 100) and then separate as >; (100|r;q|100) (100|rjg|100), and then 


rewrite that as the dot product above. 


But remember that the ground state is spherically symmetric, so the expectation value of f or fg is zero! (Alter- 
natively, we can think about parity — the ground state is even, while the operator is odd.) And similarly, this type of 
argument makes the second term's matrix element also zero, and thus the entire first-order energy shift is zero. 


So we have to go to second order, and we end up with 


| (N|SH|100; 100) |? 
Ey ~ » 0 0 
EW ~~ Eye 


N not ground 
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(where we've moved the negative sign into the denominator). Because the ground state is non-degenerate, we notice 
that E) will never be the same as EO, so our denominators will always be nonzero. Unfortunately, calculating the 
matrix elements in the numerator is pretty hard, so we'll just say a few things about them. We'll get something similar 
to the (100|F4|100) - (100|7|100) term from above, but with various excited states on the left instead of the ground 
states, and we need to excite both atoms to get something nonzero. Instead of calculating directly, we'll just estimate: 
since —E) > —£0) for all states N (this is the minimal amount of excitement that we can do on the atoms), we have 
Fem > FEO for all N, and thus we can estimate this expression above by removing the complicated weighting 
in the denominator: 


3 | (N|SH|100; 100) |2 1 


Fe) 2 = 
‘ Et) — Ef Ey +E 


o S © (100; 100/5H|N) (N|5H|100; 100) . 
N not ground 22 N 
And now because we have a sum over different eigenstates N, we can use completeness: adding back the cases that 


vanish, we get the operator >), |) (N| = 1 appearing in our sum, and thus we're just left with 


1 


= = —~ (100; 100|(5H)*|100; 100) . 
ED - Ef 


And now we can calculate this for real: 


1 et 


= . 2 ‘ 
= 50 EO Re (100; 100] (x4xe + YaYs — 2ZaZB)°|100; 100) , 
11 22 


and now this is an overlap of hydrogen wavefunctions and we can do the integral explicitly. It turns out the answer we 


end up with (after substituting in all of the energies that we know) is 


1 ef éag —| — 8e7.ae 
-2(1-2) (3) ®° = 


It turns out the exact answer (without the approximation) has a factor of —6.5 or so instead of —8, so we did a pretty 


le 


good job. And the important thing is that we have a potential that goes as ae between the atoms, and that’s exactly 
the van der Waals potential (that’s why the force is proportional to a where R is the distance between the two 


protons). 


Remark 26. Notice that even though our ground state is not degenerate, there are other (excited) states that are 
degenerate, so we proceeded without thinking about diagonalizing 6H. But this is okay, because we get the same 


answer whether we use the diagonalized basis or the original one. 


For example, suppose that |1) ,|2) ,|3) in a system are degenerate at energy EO, but |0) (the state we're trying 


to estimate) is not. Then our second-order energy correction looks like 


1 
_ (0 |5H|m) (m 64/0 ) + ae 


~ (0 0 
EY Ey gas 


and the key reason that the sum over m = 1, 2,3 is okay (even if we don’t use the “good states” mentioned in lecture) 


is that the change of basis which gets us to the good states | m/) 
|) ) = S- Umm’ mi) 
m 


has a unitary matrix U (meaning U'U = /), and under any such transformation we have (check this by plugging in our 
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expression for | m/) from above) 
En) (=F ot) (ot 
m ny’ 


So as long as the state that we're trying to estimate is non-degenerate, we don’t need to worry about degenerate 


perturbation theory! But we'll get into the details of that in the future. 


7 February 25, 2021 


Professor Metlitski wants to make sure we all know about the gather.town space that was set up by Professor Stewart 
— it’s a good place for us to work on problem sets and go to office hours, and the link is on Canvas. 

Last time, we continued studying time-independent perturbation theory, starting to consider the case where we 
have energy degeneracies. In other words, we start with an energy level E) which is N-fold degenerate, and we study 
how it evolves as the perturbation is turned on. (Generically, the degeneracy will split into N different states.) These 
states |q,) must satisfy the Schrodinger equation, and we looked for a power series expansion in » for the perturbed 
states. And remember that the main difference is that we don’t know the zeroth-order states or good states at 


the beginning: instead, we pick out those particular eigenvectors via 


N 
jw) = > [n, k) alk, 
k=1 


where the coefficients a;, are obtained from the eigenvector equation 
1 
[dH] a, = Ee), 


where the matrix elements are basically the perturbation restricted to the degenerate subspace: [OH], = (n©, £|5H| nO), k). 
At the end of last lecture, we mentioned that diagonalizing this [6H] matrix (which is smaller than the 6H full 
matrix) can still be difficult, but we can simplify our lives using symmetry in the Hamiltonian. And that’s what we'll 


be doing in today’s lecture! 


Proposition 27 
Suppose there exists a Hermitian operator K such that [H, K] = [5H, K] = 0. (So K commutes with H® +6H 


for any X.) Then we can choose eigenstates of H©) specifically those in Vy, that are also simultaneously 


eigenstates of K. If K |n®, p) = wp |n©, p), then [5H]pq = 0 unless Wp = bg. 


In other words, using a basis of states which are eigenstates of K will “split up” Vy into different spaces based on 
eigenvalue of K, meaning that [6H] takes a “block diagonal form.” And this can make fully diagonalizing [6H] much 
easier, especially if all of the eigenvalues fp, are different (in that case, [n®, p) are actually already the good states 


we're looking for!). 


Proof. Since [6H, K] = 0, we know that 
0= (nf, p[oH, k|n, q) = (nf, p|bHK = K5H|n, a) 
But because our bra and ket here are eigenstates of K, acting with the Ks (to the right and left, respectively), we get 


0= pg (0), plot] n, a) — Ly (00), plot] n®, q) = (Lg — Lp) (0), plot] n, qa), 
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so either Wg = Lp or the matrix element (n, p|dH|n, q) vanishes, as desired. 


Fact 28 
The matrix K often has to do with symmetry in the Hamiltonian: for example, K = L, works if we have rotational 


symmetry, and other common choices are [2 and p. 


Example 29 

Consider the case N = 3, where we have three degenerate energy eigenstates [n©, 1) ; [n©, 2) ‘ [n©, 3) chosen 

to be eigenstates of K. Suppose that they have eigenvalues [1, 1, 2 with “4, # Le. Then 6H in this basis 
fe ge 10) 

will take matrix form |* »* Of (with all *s arbitrary, except that the matrix is Hermitian), and in particular 
0 0 « 


|n©, 3) is one of our good states (because it is an eigenvalue). The other two good states are then some linear 
combination of [n©, 1) and |n© 2). 


It turns out we can work out the higher-energy terms as well, and we can read or work out the derivation on our 


own. But the results are as follows. First of all, our second-order energy correction is 


Notice that we need to take matrix elements with the good states lv). and the intermediate states p must lie outside 
the degenerate subspace (to avoid denominator being zero). And the corresponding first-order state correction has 


a small difference as well: because our Hilbert space is a direct sum H = Vy @V_, we get contributions for each piece: 
1) = WP), a) 
Ve 


The first piece here is similar to the non-degenerate case, in that 


Vy 


(p° (0) 
e)L =-Leo gor PP) 
Ms ee ERE,” 


(remember again that we're using good states) is a linear combination of terms in V,, so this part of the correction is 


in V,. And the piece for Vy is rather different from what we're used to: 


(oP [5x |0) (0 [aH] y) 
EO a EO 


1 
Ww) => em 7 


( 
JAI En | p#n 


Notice that we're assuming in this formula that all of the states in the degenerate subspace split at first order, so 
1) 
J 
and remember that we don't even know all of the good states in that case! So we'll try to discuss that a little bit now. 


that the energy corrections EC are all different. If the degeneracy is not fully lifted at O(A), we have to work harder, 


For example, suppose that we have the diagonalized matrix 


EY 0 0 
[H}=| 0 EM? oO 
0 o EY 
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This means that two of our states exit X = 1 with the same “slope,” and the last one exits with a different slope — 
more concretely, the energies are E() + Ne and E©) + rESD to first order, and the good state corresponding to 
the latter energy is |, 3) (but we don’t know at order X what the good states a |n\, 1) +6 |n©, 2) are supposed 
to be). So then we have to use the second-order equation in A, and here’s the general strategy for that in the case 
when we have a fully unlifted degeneracy at O(X), meaning we have [6H] = EW .1 for the N x N identity matrix /: 


we construct a matrix M®) such that Ma, = EY) ais where (instead of [6H]) we have the matrix 


Mp =-> 


pAén 


(n, 2)6H|p) (p|6H| n', k) 
E) = EF) : 


This should look a lot like the second-order energy correction in the non-degenerate case, and in fact when the levels 
are non-degenerate this is exactly a 1 x 1 matrix whose entry is the energy correction! So everything we've discussed 
so far is consistent, and we can visualize this numerator in the same way that we described a few lectures ago: Mi?) 
describes all of the contributions from “jumping energy levels” from [n, k), then to |p, and then back to |n©, 2, 
over various energy states that aren't at the same level as |n©, k) and |n©, é). 

So if we diagonalize M) and find its eigenvectors, we hope that the eigenvalues are all different (meaning that 
we finally know the eigenvectors a; and thus we know the good states). But if some of them are still the same, we 


then go to third order, or fourth order, and so on. 


Remark 30. We'll apply this kind of formalism to two of the problems on next week’s problem set, which is now 


posted. 


That's all we'll do for the formalism of time-independent perturbation theory; we'll now apply it to a classic problem, 


the fine structure of the hydrogen atom. 


Example 31 
The usual Hamiltonian for the hydrogen atom is 


2 


yo) _ B _ & 
2m ip 


MeMp_ ~| 


where we're using Gaussian units (So no €9s) and m is the reduced mass — sq lies 
le p 


First, we'll do a bit of review: recall that the characteristic length of the hydrogen atom system is the Bohr radius 
2 


ag = —5 © 0.53 angstroms 
me 


(where an angstrom is 107° meters), with energies written in terms of ap as 


— @ 1 Ry 
" 2agn2 sn? 
for the Rydberg constant Ry = 13.6 eV, where n is a positive integer known as the principal quantum number. It's 


also useful to write things in terms of the dimensionless constant 


e? 1 
a= —F-—, 
fic 137 
because we can then write j 
e* met Bnd 
— == = ame 
do fi 
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where mc? is the rest energy of the electron. This means that the typical energies of the hydrogen atom are smaller 


=o 
19000 


We can also think about this system in terms of the typical momentum of the hydrogen atom: we know that the 


than the rest energy by a factor of a? & 


momentum will be on the scale of 
fi me 


pyr a Sg amc 
(because there’s a position gradient in the momentum operator and ao is the length scale). So the typical velocity of 
the electron is ac (about 1 percent of the speed of light), which is basically nonrelativistic, though there will be small 
corrections due to relativity not included in H. And figuring out those energy corrections will be our goal later, but 
we'll ignore relativistic corrections for now. 


Talking about the hydrogen atom spectrum in a bit more detail, recall that it is highly degenerate: for any fixed n, 


the energy E, = -£ 4 can have orbital angular momentum & € [0,1,--- ,— 1], and for any fixed 2, we can have 
azimuthal quantum number m € [—£,—@+1,--- , 4]. For any fixed n, we then have the angular momentum states 


|2, m) satisfying the eigenvalue relations 
[? \@,m) = #e(2+1)|é,m), Lz|é,m) =hm|é,m). 


So for each principal quantum number n, we have a total degeneracy of 


n-1 £ n-1 
Stl 1= etna. 
£=0 m= £=0 

For example, for n = 3, we have one state in £ = 0, three states in 2 = 1, and five states in £2 = 2, for a total of 9. We 
often return to £= 0,1, 2,3 states as being in the s, p, d, f states (this comes from the various orbitals in chemistry), 


and the wavefunctions for these various eigenstates are 


Q 
Wneml ~ (=) - (polynomial of degree N in LEW Y) (8,9), 


where Ym is the angular momentum harmonic and n= N+ 2+ 1. (In particular, the lowest energy state for each 


£ 
2 has N = 0, and N represents the number of nodes in the radial wavefunction.) The (<) is particularly relevant 


because the wavefunctions for all 2 > 0 vanish at the origin. In particular, the simplest wavefunction is the ground 


state 
e 1/20, 


W1,0,0 = 5 
Tae 


So returning to the degeneracy in energy, we should note that this is not a general property of central potentials 
V(r): we know that states form angular momentum multiplets (over all values of m), but in general states of different 
values of 2 do not need to have the same energy. What's happening is a four-dimensional symmetry, the conservation 
of the Runge-Lenz vector, and we'll see that relativistic corrections will make some of these degeneracies split! 

We know that the electron also has a spin, so the full Hilbert space should be a tensor product H = Vorbitais @ C2, 
with basis given by Wnem(7) @|t) and Wn2m(F) @ |L) (over various allowed values of n, 2,m), where |t) represents the 

$, 4) state and ||) represents the [5 —4) state. 

The idea is that spin has been ignored so far, because H(?) doesn’t do anything to the spin part of the tensor 
product space. So there's an additional two-fold degeneracy because of the spin, and thus we've now found that the 


nth energy level actually has a 2n?-fold degeneracy. 


Remark 32. /n experiments, we often apply an electric field or magnetic field to our system and see how the spectral 


lines change, and we'll see throughout the next few weeks what the effects of those external fields are on the spectrum. 


24 


Returning our discussion to the Hilbert space H = Vorbital x C2, we mentioned one choice of basis for this space 
above, which is the uncoupled basis. Basis vectors of this space are labeled as |n, 2, mg, m;) (where the last number 
here is the S, eigenvalue for the electron spin), and the idea is that the states here are all tensor products of the 
orbital and spin states. But as we start to study relativistic corrections, it makes sense to use a coupled basis instead, 
where we look at multiplets of the total angular momentum J=L-+5S from the combination of the orbital and spin 


angular momentum. Then we have eigenstates satisfying the familiar equations 
P |i) = PIG +1) Lim), Selim) = Aimy i,m) , 


where m; € [—j, -+1,...,J], and we have the angular momentum addition rules from 8.05: an angular momentum 


é-multiplet tensored with a spin multiplet is a direct sum of various j-multiplets: 


red ()e(-4) 


(except without the second term here when 2 = 0). So we construct our basis as follows: for any fixed n, 2, we have 


construct our coupled basis vectors 


[n, 2,j, mj) = S- |n, £, Me, Ms) Oe ie 
m,Ms 

for the Clebsch-Gordan coefficients OP aes And our coupled basis vectors are all eigenstates of the operators JF de, 
and also L2 and H) (because we formed these states out of states that have a definite 2 quantum number and energy)! 
But these new basis vectors are combining different values of mg and m, together, so they are not eigenstates of L, 
or Sp. 

This coupled basis is just another choice which turns out to be useful for the hydrogen atom when we take relativistic 
corrections into account, because L and S individually don’t commute with the Hamiltonian — only the total angular 
momentum is conserved! 


We'll use the notation nL; for our various |n, 2, /) states, where n and / are numbers and L is s, p,d,f (and so on). 


Example 33 


For £2=1,n= 2, we have six states in total: 3-2 in the uncoupled basis, and 2+ 4 in the coupled basis, labeling 
the coupled multiplets as 2P3/2 and 2P,/2. For £2 = 2,n = 3, we have ten states in total: 5-2 = 10 in the 
uncoupled basis, and 4 + 6 in the coupled basis, labeling the coupled multiplets as 3D3/5 and 3Ds5/o. 


We'll bring in the relativistic corrections next time and bring in the perturbation theory! 


8 March 1, 2021 (Recitation) 


We'll cover some more degenerate time-independent perturbation theory today and then start moving towards the fine 


structure of hydrogen. 


Fact 34 


The hydrogen atom has both bound and unbound states — the latter are often called scattering states, and they 


are necessary for completeness. But we won't need that explicitly for the problem set. 
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Recall that we were studying the Van der Waals force as an example of perturbation theory last time: we studied 
the energy shift of the ground state (which had no degeneracy) to get the a potential. And if we go back over the 


notes, we'll find that our calculation has some similarities to the problem set. 


Example 35 


Today, we'll continue studying the Van der Waals force by studying some degenerate states above the ground 


state. 


Recall that we can characterize our two-atom system by looking at the states indexed by (4, £4, Ma), (Ne, £8, Ne). 
The ground state of this system is (1,0,0), (1,0,0), and now let’s try to think about the first excited state. For the 
next lowest energy, we can either have (2,-,-);(1,0,0) or (1, 0,0); (2,-,-), and 2 can range from 0 to 2—1 = 1 for 
the excited atom. Then m ranges from —é to 2, so there are 3 states for 2= 1 and 1 state for 2=0. This gives a 
total of |2(3 + 1) = 8] total states at the first excited energy level: 


|100; 200) , |200; 100) , |100; 21m) , |21m; 100) 


where m € {—1,0,1}. So we want to know how the Van der Waals interactions influence these states, and to do 
that, we need to do degenerate perturbation theory. The first thing we need to figure out is how to diagonalize the 


[OH] matrix, and we do this by calculating the matrix elements 
(nalama, NelemB|dH| nlm; Ngeg Ms) . 


This looks like we have to do a lot of different calculations, because there are 8 states we can put on the left and 8 
states we can put on the right. So we'll start by thinking about where there are zeros in this matrix [6H]: remember 


that the 6H has a ra - fg component, so when we evaluate we end up with something that looks more like 
(nalamal fal lala) 


(times the analogous expression for B). But the 7 term in the middle is odd, so the only way to make this expression 
nonzero is if one of the states is odd and the other is even. And it turns out that when @ Is even, we get even 
polynomials in the wavefunction, and when @ is odd, we get odd polynomials. So that means we only get nonzero 
values if £4 and 2’, are of different parity — specifically, one of them must be 0 and the other must be 1 in this case. 
So if we want a nonzero overlap with |100; 21m), we need another state where we have |?1?; 70?) for some numbers 
?. And the only way this can happen is if we are computing the matrix overlap with |21m/; 100)! So that simplifies 
our calculations significantly — in particular, |100; 200) and |200; 100) have zero overlap with any of the states, and 
we just need to compute 
(100; 21m|6H|21m/’; 100) 


for various values of m, m’. And furthermore, the 6H doesn’t give a perturbation to the m quantum number, because 


(letting Laz + Lez be the total angular momentum in the z-direction) 
[5H, Laz + Lez| =0, 


because 6H only has terms of the form f4- fg or (f4-2)(Fe-Z), which are symmetric around rotations in the z-direction. 
And this tells us that the only way to get a nonzero matrix element is if m= m’! That's because we can compute the 


matrix elements of 0 = [6H, Laz + Lez], which will turn out to be things like 


0 = (100; 21m|[6H, Laz + Lgz]|21m’; 100) = (m — m’) (100; 21m|6H|211m’; 100) . 
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This means we end up with three 2 x 2 matrix problems (for m = —1,0,1) instead of an 8 x 8 matrix, and that 
breaks the degeneracy for everything except the |100; 200) and |200; 100) states. So this method gives us six of the 
eight good states, but we need to go to higher-order to figure out the other two. 

Let's focus on what we can compute, though: we get results like 


1 
(100; 21m|6H|21m; 100) = pak 


0 A 
for some numbers k,,, and that means each of the 2 x 2 matrices looks like ; 1 for some A. We therefore find 


kn 
RS 


that we have energies of | 4 with the respective eigenstates 


: (|100; 21m) + |21m; 100)) 


So first-order perturbation theory has given us an interesting result: we have a zB potential for Van der Waals, which 
is stronger than in the ground state and can be both attractive and repulsive (depending on the superposition). 


We'll now move onto another example: 


Example 36 


Suppose we have a spin-1 particle with original Hamiltonian H( = AS? and perturbation ASH = A(S2 — $2). 


The thing for us to remember about spin operators here is that the raising and lowering operators are useful: 


working with the operators 


Sz = 5S, +i5, 


is good for calculations, because we know that angular momentum operators give us 


Js [jm) = AiG +1) — m(m+t1)|j,m+41), 


and the Sy operators serve that purpose here. In particular, we can rewrite S, and Sy in terms of the raising and 


lowering operators to find that 


2 2 
ee (=) . 


2 


Problem 37 


Let’s determine the first order energy corrections Be for this perturbed Hamiltonian. 


We should first find the eigenstates of the original Hamiltonian H(), which are |1,1),|1,0), and |1,—1) (the 
eigenstates of S, and of S2 are the same). Since the eigenvalues of Sz are hm, the eigenvalues of H( will be A(fhm)? 
for m= —1,0,1 (meaning that |1, 0) has eigenvalue 0, and |1, 1) and |1, -1) both have eigenvalue Af?). 


For the nondegenerate state, the energy correction is the matrix element 


1.0). 


This turns out to be ae = 0| (because raising or lowering gives us no overlap). But it takes more work to compute 


5 = 6 
(1, 0|6H|1, 0) =a(1.072 


the energy corrections for the degenerate states, and we need to compute the 2 x 2 matrix [6H], which turns out to 
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O:. ae 
be P ar So that’s the same structure as the problem above, and thus the good states are again the sum and 


eT with eigenvalues | E) = +)n? |. 


One thing we'll need in lecture tomorrow is the addition of angular momentum: for example, if we have an 


difference of the original basis states: they are 


operator J=E+ Ss, and we have an£=2ands= 3 system, we should be able to check that the allowed total states 


lj, m) must have either / = 2 or 3, and mcan range from —j to / (as usual). We usually write this as 


2@x= 202 
an as 


Indeed, we can count how many states we have here: 5-2 = 4+6 = 10. We'll explore more about the Clebsch-Gordan 


coefficients and use them for the fine structure of hydrogen next time! 


Q March 2, 2021 


We started discussing the hydrogen atom last time, with the goal of understanding the fine structure. Recall that the 


Pepe eg 


mCc* 


unperturbed Hamiltonian H© = ca — = has exact energy levels E, = —5a which are highly degenerate, and 
we can describe the degeneracy either in the uncoupled basis |r, 2, ™, m;) (where 2 is the orbital angular momentum 
quantum number, mz, is the azimiuthal quantum number, and mg is the azimuthal spin of the electron). With this, we 
calculated last time that there is a (2n?)-fold degeneracy at each level n. 


But the other basis we can use is to label using |n, 2, /, mj), where we use eigenstates of J? and Jz (for the total 


angular momentum J = L +S), where j = + 5 (from the rules for addition of angular momentum). We label our 
states in the coupled basis as nL; (using spectroscopic notation), where n and j are the usual numbers and L is 
S, P,D, and so on (instead of 0,1,2,---). 


Our goal now is get closer to the correct structure of the hydrogen by introducing relativistic corrections. 


Proposition 38 
Our Hamiltonian and Schrodinger equation are not Lorentz invariant — to see that, we can just look at the 


free-space version 
Av2 
2m 


v, 


in which time and space enter in different derivatives (not on the same footing). 


So we're going to need a more general treatment, which comes from the Dirac equation (a relativistic upgrade of 
the Schrodinger equation). We won't discuss this equation much in lecture — we could use the full Dirac equation, but 
we only need it to give us small corrections to Schrodinger (because we know the electron is basically nonrelativistic). 


So our Hamiltonian will now look like 


52 aA 2 
p p 1 lids -s fi 5 

H= -L4 V. 
( ) Smee ~ 2mec rr : Bmece Y 


The three terms added are known as dH, OHs-o (spin-orbit), and 6Hp (Darwin), and together these are known as 
the fine structure perturbations. We can first estimate their magnitude: recall that the zeroth order piece H has 
2\ so they're down by a factor of a? compared to the rest mass. It turns out 


Let's 


typical energies proportional to a2mc 
that the other three terms here are on the order of a*mc?, so they're smaller by around a factor of a2 ~ +9900- 
check this now: 
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= 4 
> We have dH ~ goa ~ (amd =a*mc?. 


2 


+ Plugging in the potential V = -£, we have 6Hs.o = =¥s eS. [Tw ma Sh -fh. And now because a9 = 5, 
2 


mc? he fic 


of the potential V, shows the same thing for the Darwin term.) 


4 
plugging everything in gives us — fm h2 wm (¢) c? =a‘*mc?. (A similar calculation, plugging in the form 
So because we know that the corrections are indeed smaller than the unperturbed Hamiltonian, we can apply 
perturbation theory. Defining the fine structure perturbation 
OH = OHre + OHs-0 + OHD, 


we can write H = H() + 6H, and apply perturbation theory. Strictly speaking, we need to do degenerate perturbation 
theory, which involves forming a 2n? by 2n? matrix and diagonalizing the resulting [6H] matrix. We'll do this term by 


term. 


Example 39 


First, let’s look at the term 6H, = — soa 


We can think of this as a relativistic correction to the energy of a relativistic particle Exin = \/ p2c2 + m2c* = 
meey/1+ —_ And we're basically expanding the square root in a Taylor series as 


2 2 2 
=p i : a oa ee 
2mc2 8 \ m@c? 


The first term here is a rest energy (a constant, which doesn't really matter), and the second term is our familiar 


non-relativistic kinetic energy. So the next term after that is indeed the correction that we're making in dHye (and 
terms after that are of order a®, so we won't take those into account). 

To understand this term's effect on the spectrum, we need to compute matrix elements at energy level n which 
look like (in the uncoupled basis) 


(n, 2, me, Ms|OHrei|n, 2’, mp, M6) . 


But in fact, these elements all vanish whenever 2 = @ or my = m or ms # m. In other words, this matrix is already 
diagonalized in this uncoupled basis, so we just need to compute the diagonal entries! And this happens because of 
the following fact from last lecture: if we have [K, 6H] = 0 for some Hermitian operator K, and K |n©, dy =Aqg |n, q) 
for all g in the degenerate subspace, then (n, g|6H|n, q') = O unless A, = Ag. And we're applying the result here 


using the operators K = L?,L,, and S, — note that because 2 


is rotationally invariant, it commutes with angular 
momentum (2, L,, and therefore p* will also commute with them. And furthermore, the 6H; term doesn't act on spin 
at all, so it also commutes with S,. (And because choosing K to be 2, Lz, Sz give us eigenvalues of £(2+1), mg, ms, 
we can indeed only have equality if the quantum numbers line up.) 


So now let’s compute the energy shifts, which are given by the diagonal entries 
EMYrel  _ (8, me, ms|OH rein, £, Me, Ms) . 


n,l,me,Ms 


Nothing happens to the spin, so this can be rewritten as 


1 
= ~ Smace (Ba tiong| (OY eatin) : 
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But now we can apply one copy of f? onto the bra and one copy onto the ket, so this becomes 


1 


= ~ Bm3c2 (P'n.e,m|PWne.m) ; 


Furthermore, we know that Wp,¢,m, Satisfy the unperturbed Schrodinger equation 


pe 
& a v) Wn t,m = FOAL, ees 


so that 
Pedn.em = 2M(EO —V)Wnerme- 


So plugging this back in gives us 


1 


ied (2m)? —; 
ete, = 2: (EP —Vovnem _ 


n,l,me,Ms 8m c2 


(EO — V)¥nem) = (Wresme 


(EL) —V)?|Woem) 


(using that V is Hermitian). And now we expand the square out and plug in V = ae 


= 
1 
= = Sige (Ynem Vntm) : 


So all we need is the expectation value of + and > in our (unperturbed) hydrogen atom eigenstates, which is something 


e? et 
(E0))? — 260) (-5) vo 


we'll do on this week's problem set! It turns out that (+) = noe and (4) = FERSEe so the final answer for the 
2/70 


energy corrections due to dH ye is 


Gyre... eet me* An 4 
me Pee 8 nt \e+1/2 


Remark 40. Notice that the shifts in energy are indeed of order a*, as expected, and also notice that there is no ms; 
or mg dependence. The former is expected, and the latter is true because angular momentum commutes with 6 He. 


So we haven't fully lifted the degeneracy yet. 


So the uncoupled basis looks like a good basis right now. Unfortunately, we have two more terms, and once we 
add those in, we'll find that the fine structure good basis is actually the coupled basis! So let’s change bases and 
calculate 

(n, £5, mj|5H | n, £5", mm). 


Because the coupled states with some & are a linear combination of uncoupled states with that same 2, these matrix 
elements will still be zero unless 2 = £’. Remembering that the energy corrections we found above depend on n, £ but 


not on mg, Ms, that tells us that if we fix n, 2, the resulting matrix of energy differences Is 
1),rel 
[5H magmg.miimt — Ee cl 


for the identity matrix / of the right size. But the identity matrix is the same in any basis! So even in the coupled 
basis, for fixed n, @, 
[5H jm stm = Be, 


and thus we arrive at 
(nj, m,|SH"'|n, £1, jm) = E005 iS mm 


So both the coupled and uncoupled 6H! are good states, and the point is really that this term isn't acting on spin at 


30 


all. 


Example 41 


Next, we'll look at the Darwin term, which is 6Hp = aw, 


We know that V = =— and when we take the Laplacian of 4, we get a delta function —4163(F). So the Darwin 


term is actually proportional to the delta function: 


T 
2m 


dHp = (7). 


But the delta function picks out the value of a wavefunction at the origin, so 
OHp |n, 2, me, m;) = 0 if 240 


(the hydrogen wavefunctions vanish at the origin for states with nonzero 2). And the states with 2 = 0 are very simple 
— since mg = 0, there’s only a degeneracy from the spin term, and d6Hp doesn't act on spin at all. So the matrix 


elements of our Darwin term are 


(1, £, me, Ms|5Ho|N, £7, MEM) = be,06e,05m,06:my,05m.,m, (Yno0l6Hd|Wnoo) 


en Se eRe: - 
= usieobasbnstnn pc | rPnoo(7)O” (F)Wnoo(F) 
2h 


Te 
= 8¢,05¢',091m,09mm,,09me.m, 5 ma SF 1 noo(O)1 


It turns out that we have |Wnoo(0)|? = ee (we can check Griffiths to derive this), and thus the nonzero overlap 


on 
: > 1 ; ; ; ; . 
is |a*me a3 | So putting this all together, the matrix elements for the Darwin term are diagonal in the uncoupled 
basis, and they vanish unless 2= my = 0: 
1 
(1),D 2 
EV)? = af mc aa 


(where this energy shift is only for two of the states). And we could have equivalently said this fact for the coupled 


basis, because |2 = 0, my = 0, ms) is the same as \2 =0,;= 5, mn = ms) (since 0 @ $ is just 5). 


Fact 42 


The physical eee of this term is that this electron (which is initially treated as a point particle) is “fattened” 


to a ball of radius —- f (the relativistic Compton wavelength). 


And this happens because the Dirac equation describes both the electron and the positron, but we get rid of the 
positron when we simplify to Schrodinger. (An electron, in the presence of V(r), can “become a positron for a bit.”) 


But we can read more about this in Professor Zwiebach’s notes. 


Example 43 


Finally, we can study the spin-orbit coupling term 0Hs.o = 5 


This is the only term which actually involves spin, and it is the reason why we use the coupled basis! This term 
also comes from the Dirac equation, but intuitively, we can imagine an electron with some velocity around a proton 


(which we lock at the origin). In the rest frame of the proton, the electron only sees an electric field, specified by 


31 


deE = =VV, but in the rest frame of the electron, we see the proton moving, and thus there is also a magnetic field 


term, which we can figure out by doing a Lorentz transformation of the original electron field: 


 .. de wcge’. AB VV 1 p 3 
Bi = 7x E= Bx (2) - 2 x (Evy) 


de c-em 


(using that V(7) = V(r) is a central potential, so that VV = V/(r)- 7). But then — x 7=—/7 p, which gives us an 


angular momentum: 


And because the electron is stationary in its rest frame, we can understand how a magnetic field couples to this 


stationary electron — it does so to the electron spin. But more generally, a magnetic field couples to the magnetic 


dipole to yield H = —zg- B, because we have (classically) magnetic moment from a loop of charge given by ff = a) 
(if Fis a vector with magnitude equal to the area of the loop), and then if a charge q is going around a circle, it turns 


out that magnetic moment is proportional to angular momentum 


= q >, 
= ———7 an 
H 2mc 
because the period of the motion is T = eRe and then the current is / = 2 = 5%, so 


1 qv oe qvR qlz 


And even when the electron is not moving, it still has a magnetic moment from the spin, and we have 


q 
2mmc 


He = 9 S 
where g is a “fudge factor” which we know is 2.00231930436182(52) (to extreme precision), but we'll say it’s 2 for 


our purposes. So putting it all together, we get 


7) ee ee: ee ee 


me: Cc 


and thus our spin-orbit coupling term is indeed 
LL 
ObHs_.0 = aie (r)L -S. 


So we're off by a factor of 5 from the actual answer, and the actual source of the 5 comes from the following error in 
our argument: we went to the electron’s rest frame, but the electron's velocity is changing (because it’s undergoing 
orbital motion). So we don’t actually have an inertial frame (this is an effect known as Thomas precession), but 
the argument is close enough for us to get some intuition! Next time, we'll compute the effects of this term on the 


spectrum of hydrogen. 


10 March 3, 2021 (Recitation) 


We'll do a bit of review for the fine structure of hydrogen today, and we'll also explain why this method is relevant to 
other systems as well (in other words, is there any other example besides hydrogen where this kind of fine structure 


calculation is useful). 
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Recall that our Hamiltonian for the hydrogen atom is 


Hydrogen = HBohr +4 HFine-structure 


where our “unperturbed Hamiltonian” is H®ehr — - — and there are three different terms in the fine-structure 
correction (p*, i” Ss, and Darwin). It turns out that there are other corrections that we often make, which lead to 
interesting effects: for example, we have a hyperfine structure term HHFS which we discussed a little in 8.05, as well 
as terms from the Stark effect (giving us a X- E) and Zeeman effect (j- B). 


There are always more corrections we can make, and they get more subtle — just for a sense of scale, the Bohr term 


4 Me 


has magnitude a?, the fine-structure terms have magnitude a*, the hyperfine structure term has magnitude a ae 


and the Stark and Zeeman effects depend on the magnitude of the external field. But then we have a Lamb shift term 
which is a> Ina at first order, and so on. Basically, there is a very rich structure of hydrogen, and a lot of it comes 


from relativistic and quantum relativistic corrections! 


Example 44 
Let’s take a look at the hydrogen hyperfine structure, which involves a Hamiltonian which couples the electron 


and proton spin: 


(ofeepneeo ||| ieee HG IE = Sa. 


Sy i Sp 6°(7) 


TES ey 
Amempc? | 3 


Pe 


(where ge, Jp are the g-factors for the electron and proton). We'll only focus on the first term today. 


In 8.05, we derived this by noting that the magnetic moment is proportional to the spin (for both the proton 
and electron). And we solved the problem by taking the two spin-1/2 particles (proton and electron) and going to a 
different basis with § = Se + om and this is good because 


Sec (as <5) 


is especially easy to deal with, since S2 and S both give us f?se(se + 1) = $h?. So two of the terms here are 


proportional to the identity operator, and that means we'll write 
ee ss 1 3 
Sey [s°- S11] 


We can then look at the combined spin states of the proton and electron: since 5 ® 5 =0@1, we haves=Oorl, 


and the eigenvalues of S? are then either 0 or 2h?. So what we did in 8.05 was to find the first-order correction 
EO). = (n,£, m, 5, Ms|Hues|n, 2, Me, S, Ms) 


(we should have (0) superscripts on both of the states above). We'll specialize to 2 = 0, and this energy correction 


then turns out to be 


2m JeGpe? —3r s=0 


= |Wrem(0)|? - 


3 Migm,C 


(where the different cases comes from the total spin S). 


Now that we've reviewed this material from 8.05, let’s consider a slightly different system: 
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Example 45 


Suppose we replace the proton in the hydrogen atom with a positron e*. (The positron and proton have the 


same spin.) Then the idea is that the positron and electron can form a bound state called positronium. 


The equivalent Hamiltonian to the Bohr term for this system is 


where mp is the reduced mass of the system. This is basically just the electron mass in the hydrogen atom, but the 
mass of the electron and positron are the same, so the reduced mass here turns out to actually be mr = *. But 


* 4 F 2 
other than that, we have a very similar structure as hydrogen, so we can write down our energy levels (where a = ;-) 


a?c?mp 


(0) _ 
= 2n2 


So the first energy level is half of what it was before, and our Bohr radius aj = aS is twice as large as it was for the 
hydrogen atom. 

But there are other things that are different about this system when we replace the proton with a positron: for 
example, the two particles might annihilate each other, which means that our bound state might not even form. This 
concept is beyond what we can calculate right now, but it turns out an electron and positron cannot annihilate to a 
single photon by three-momentum conservation (think about the center-of-mass frame where Pe+ + Pe- = 0). So we 
need at least two photons — it turns out that the s = 0 state (para-positronium) decays to two photons, and the 


s = 1 state (ortho-positronium) decays to three. 


Fact 46 


It turns out having extra photons costs extra factors of a, so ete” — yy (the s = O case) has a potential 


proportional to a>, and ete” > yyv¥ has an “effective potential” proportional to a°. 


So these are smaller than the H©®) energy levels, and actually even smaller than the hyperfine structure (which 
is actually the same size as fine structure because there's no 7 term), so transitions are unlikely (the states stick 
around for a while) and positronium does have a bound state for some time. (We can think of what's going on here 


as having “imaginary potentials,” which basically corresponds to a “loss of probability.”) 


Fact 47 


The lifetime of the ete + yy decay is on the order of 107!° seconds, and the lifetime of the ete > yyy 


decay is on the order of 107’ seconds. So these don't last very long, but they still exist for long enough to study. 


Ge _ Ge _ 2 


me Me Me 


So now if we return to the boxed formula for the hyperfine splitting above, the term a must turn into 
(the positron and electron have many properties that are the same, including the g-factor and mass). And it turns 
out that |Wp,2-0(0)|? = am = | p'yeregen(Q)|? (because our Bohr radius ap has changed by a factor of 2). So if we 
keep track of all of these different factors, we find that (again, we're only doing the 2 = 0 case) 


Ome Val fay - Oo, \ 5. ea oes 
MES = 37308 75 (3 Bl FOr) 7S 


where S = S.+ +S.-, but the calculation is the same as for hydrogen, and the 6°(r7)2S? is a new effect (that has no 


analog in the hydrogen system). This new term comes from quantum mechanics allowing virtual photons (energy and 
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momentum conservation can be violated for a very short amount of time), meaning that we get quantum transitions 
like ete~ + y* > et e~. And the spin S? term comes from the spin of the photon being 1 (meaning that s = 0 gives 
0 and s = 1 gives 2f*). So our story is a little bit modified, and we'll end with the answer for the first-order energy 


correction: 


81 —3n s=0 


(n,£=0,m=0,5, Ms|Hyes|n, 2 = 0,m =0,5, ms) = amc ane [Were-) (0)? - 


The fh? comes from +f? in hydrogen, plus the new 3f? effect in positronium.) So we've now seen the fine structure 
4 4 


in another system besides hydrogen! 


11 March 4, 2021 


Last time, we started talking about the fine structure of hydrogen — the familiar Coulomb Hamiltonian is the leading- 
order approximation to the hydrogen Hamiltonian, but relativity and the Dirac equation give us fine structure correc- 
tions. We've identified three such terms (the p* kinetic energy correction, the spin-orbit coupling, and the Darwin 
term), and we started using perturbation theory to figure out the energy shifts. We'll now finish this task for the 


spin-orbit coupling term and arrive at our final answer — the Hamiltonian we're now studying Is 


dHs.o = 


Recall that in the coupled basis, the Darwin and the kinetic energy corrections are both diagonal, and it'll turn out 
that using this coupled basis helps with the spin-orbit coupling term. This is because we can write S-L in terms of 


squares of angular momentum operators, and J=L +S is convenient for this: 
§-L= sFs = ¥. 


To understand why the coupled basis is good for this Hamiltonian, we can make a few observations. Notice that the 


spin-orbit Hamiltonian 6Hs_9 commutes with fi. the total angular momentum. We can check this as follows: first, we 


look at the angular momentum terms. We see that J? commutes with J because it commutes with each component, 
S? is just a constant 2h, and J=§+L commutes with L2 because § and L2 don't interact, and L and L2 commute. 


Additionally, the coefficients and 4 in front also commute with spin because they are rotationally invariant. Similarly, 


we can check that dHs_9 also commutes with F*| and | £2 (because LE commutes with [2, S and L2 don't interact, 


and so on). 

Therefore, |n, 2, j, mj) must be a good basis for this Hamiltonian, because we can simultaneously diagonalize 6Hs-o 
with the various angular momentum operators. (In other words, (n, Lf, m,|5Hs-o|n, Ly, m7) =Ounless£=2,j =/', 
and mj = m, because otherwise can apply dHs.o to either the bra or the ket and get different constants out as 
eigenvalues.) So we know that perturbation within this degenerate subspace is diagonal in our coupled basis, and we 


can read off the diagonal matrix elements to get the first-order energy corrections: 


nes, m) : 


Using that the vectors in the coupled basis are eigenvectors for (P? — [? — §?), we can have that term act on the ket 


1 Zs , 
Be Les) 


2 
(1)SO € . 
En tim, =~ Fpet (nt mj 
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to give us 
e*he 


= 72a (11 +1) —2(2+1) 3) (ni mj 


We now just need to compute the expectation value of 4, and to do that, it's easier to do so in the uncoupled basis 


1 
nes, m) 


r3 


first: it turns out (see Griffiths problems 7.43 and 7.44, for example) 


1: 
(Ynem Vnem) = 


nabe(l + 1)(+ 4) 
(This expression does not depend on mz, because it is a rotationally-invariant operator.) We'll call this expression 
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g(n, 2), and now we claim the full matrix elements are 


n,£, Me, Ms| 
£ 


ce ae m,) = g(n, £)d¢.06m, 


(similarly to our argument above, the off-diagonal entries vanish because 4 commutes with the various spin operators 
L2; L2, Sz, and n needs to be the same so that the states have the same energy). So now to get things into the 
coupled basis, if we fix n and 2, the expression g(n,£) is a constant, and the rest of the expression is the identity 
matrix. So making the matrix transformation from the coupled basis to the uncoupled basis doesn’t change that part 
of the matrix, and thus even in the coupled basis, 


1 
(n, £,j,M; 


r3 


n, Of m) = g(n, £)),7 9 my.m- 


(This is the same argument that was made last lecture!) So plugging everything back in and doing some arithmetic 


gives us 


(1).S-0 _ atmce? ( j(j+1)-ee+1)- 2 
He ES Ne Cea) | 


So now we can add up the energy shifts from the different parts of the Hamiltonian (noting that all of the energy 
shifts are diagonal in the coupled basis, making calculations easier). In the 2= 0 case, the Darwin term has no effect, 
and thus 


4 


. z (GI Sates 
FOL) fs = 15 Hre ee aoe a*mc LO J 4 
bi (n,£,j, mld Het + OHs-0|n, £, J, mj) gr (3+ 20 Wer ier) 


We'll call the bracketed term f(y, 2). It turns out that this expression actually simplifies nicely: because j can only be 


e+ $ or £— S, meaning that for any fixed /, we have 2=j/ + $ or £=j—- 5. And what we find is that 


. von 2 ; dee Jil 


So for the two allowed values of 2, we have a much simpler form of f(y, 2) — there isn’t actually any £-dependence at 


all! So this gives us our final fine-structure correction: 


(fs _ 4 9 1 n 3 
Eni, = 0 MC 57a (4 4} 


And if we try plugging in 2 = QO, it turns out that this formula still works when we add in the Darwin term. To 


understand this, notice that the spin-orbit contribution is 0 for 2 = 0 (because we have a [ term in the Hamiltonian), 


but we get a nonzero contribution from the Darwin term. And if we look again at the first boxed expression for 
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spin-orbit energy shift, we have an @ in the denominator — it turns out that taking the limit 


lim EO)S-0 ( =é+ 5) = Ee 
£0 2 


exactly reproduces the Darwin term for 2 = 0. So because of certain “magic” calculations, we get a nice expression for 
the fine-structure energy shift at the end of the day! 


Some of the most important takeaways from this calculation are below: 


* The coupled basis is a good basis for the fine-structure calculation (but the uncoupled basis is not). This 
is because J. the total angular momentum, is conserved in relativistic physics, but [ and S are not conserved 
because of the spin-orbit coupling term. (One way to understand this comes from the Heisenberg equation 


a —;A, H] for operators A — in other words, A and H commuting means that A is conserved.) 


Because the angular momentum J commutes with the Hamiltonian, all 2/+1 states in a multiplet have the same 


energy. So the energy shifts do not depend on 17. 


* For a fixed n,j, the states with different £ have the same energy shift. (For example, 25/2 and 2P,/2 remain 
degenerate, and fine structure does not split them. Instead, these states only shift at a higher order in a, and this 
occurs because of the Lamb shift giving O(a> mc?) energy splitting. But this is already a result from quantum 


electrodynamics. 


- All energies are shifted down by fine-structure, but states with larger y shift less than states with smaller /. In 


other words, E,,; increases as a function of / for fixed n. 


Fact 48 
A quick logistical point about our our term paper: our topic and proposal is due on March 26, and there is a 


description that we should read on Canvas. A template will be posted shortly, but it should contain a topic, 


student peer editor (whose topic is different from yours), a brief plan for the paper, and a list of references. 


Example 49 

The next perturbation we'll look at is to place this hydrogen atom in an external magnetic field and examine the 
Zeeman effect. One application of this is to measure magnetic fields in interesting ways: for example, spectral 
lines In sunspots split compared to spectral lines in other places in the sun, and that’s because of differences in 
the magnetic field (on the order of 3000 Gauss). 


The leading-order correction due to a magnetic field is 
6H =-f-B, 


where we get a contribution to the magnetic moment from both the spin and the orbital angular momentum {fg and 


Ls: 
~ oy Bf Pane B 
dHz = —(fe + ds): B= Smelt + 2S) - Bext. 
(The extra factor of 2 here comes from the g-factor of the electron.) We can choose our coordinates here so that 
the magnetic field points along the z-direction, and then our Zeeman perturbation is 


eB, 


H7= 
ore 2mc 


(Lz +2S;z). 
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So now we want to understand how the Zeeman effect affects the spectrum of hydrogen, but our Hamiltonian is now 
H = HO) + 5H, + 5H 7. 


We now have two different perturbations, so in principle we need to calculate perturbations for the fine-structure 
term plus the Zeeman term. But there are two limits where the calculation is easier. The first is weak field Zeeman, 
where 6Hz < dH;,, and thus we can treat the first two terms of the Hamiltonian as a single starting Hamiltonian A) 
and the Zeeman term is a perturbation. And the other is strong field Zeeman, where 6Hz >> 6H¢., in which we treat 
H©) + §Hz as the unperturbed Hamiltonian and the fine-structure term as a perturbation. 

Those two cases basically occur if B is very small or very large — let's understand where the cutoff lies. Recall 
that there's a typical magnitude in the spin-orbit coupling for the magnetic field due to proton motion (which we 
should think of as the internal magnetic field) — if the external magnetic field is much larger or much smaller than 
that internal field, it tells us that we're in the strong field or weak field Zeeman regime. There's also a case where 
the two are comparable, in which case we're in the intermediate field Zeeman regime (and then we need to consider 
the matrix elements of the combined perturbation 6H;, + 6Hz, and we need to do some hard work to diagonalize the 
matrix). 

We'll do the weak field Zeeman calculation here, and the strong field Zeeman calculation is left for our problem 


set. We're trying to understand the Hamiltonian 
i (Ho 4: dH) eh 


and the unperturbed states here of H(°) + 6M are labeled by |n, 2, /, mj) with known energies E'$(n,j). We must 
then do perturbation theory in 6Hz, and here we have a slight catch: we only have an approximate solution for the 
eigenstates and energies H(°) + dH;, (to leading order). But that will turn out to be good enough for our purposes, as 
long as we have the weak-field limit. 


Remembering that not all degeneracies were broken by the fine-structure, we're going to need to do degenerate 


NIB 


perturbation theory. Each degenerate subspace is labeled by integers (n,/), and within the subspace we have @ = j+ 
and mj € [-j,--- , J]. Thus, we need to calculate matrix elements of [5H] within each subspace, which means we 
need to find 

(n,£,j,m|5Hz|n, 2, J, mi). 


But we are actually pretty lucky — the off-diagonal entries vanish again! This is because 6Hz is proportional to L,+2S,, 
and this expression does commute with (2, so using the same argument as above, we need & and £' to be the same 
to get a nonzero matrix element. Similarly, we can check that J, commutes with dH,z, so mj = m, is also required. 


Therefore, we check again that the coupled basis is a good basis, and now we can calculate 


: . eB ; ; 
Ee = (n,£,j, m|6Hz|n, 2), mj) = ae (n,£,J, mlbz + 2S-,|n, ,J,mj) . 


Writing L, + 2S, as (Lz + S,)+ S, = Jz + Sz, the energy shift can be written as 


eB ; eB ; : 
Sire (n,£,J, mjlJjz|n, 25, mj) + ae (n, £,j, mj|Sz|n, J, Mj) . 
The first term is just £2 hm; (because our states are eigenvectors of J,), and in fact it turns out the second term is 


proportional to mj; as well with some extra constants! So each of these degenerate spaces split further into equidistant 
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lines, and let's see how that plays out. The trick is to use the projection lemma, which states that 


[Ji, Sj] = ihe ijn S~ => (S); = Oa 


where the j subscript indicates that we're in any eigenstate of J. So we find that (we only care about the z-direction, 


not that expectation of the whole vector) 


(n, 8, mj 


Z\n, &4, mj) hm 
rejG + 1) — «WG +1) 


(n, £,j, mj|Sz|n, £,j, mj) = (0,6, m)|S- Jn, £.i, mj). 
But now we can simplify the dot product: 


J=C+S5 => 5 


> as 1 Ls - 
-J=L-S4 S(t [? + $2) 


by using the same trick as we did for the spin-orbit term. But now our states are eigenstates for all of SF, (2, S2, Sie) 


plugging things back in, we find that 


(a, 2,5 AM Salt fy Jp) = oy 5 (i (j+1)-—e(2+1)4 3) 


and thus we've gotten our energy shifts for the weak field Zeeman effect: 


ij+1)= 004 1)+3 
EC) 2 = Frm, (14% pei) i). 


ZU +1) 


We often call the term in parentheses gj(2), and that gives us our final answer 


1),Z = 
ae = me wie) 


With this, all of the degeneracies of the hydrogen atom eigenstates are gone, because all of the multiplets split, and 


the splittings are different for different és. 
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We've been studying the fine structure of hydrogen for the past few lectures, and most recently we've been analyzing 
how the fine structure and Zeeman effect corrections compare to each other. Recall that the set up is as follows: we 
have the Bohr Hamiltonian Hgonr, and we also add two correction terms Hy, and Hz to that. And the reason this is 
interesting is that the different pieces of the Hamiltonian prefer different bases: for example, we can use the uncoupled 


basis |n, 2, me, m;) or the coupled basis |n, 2,j, m,;). (Here, because this is hydrogen and the spin of the electron is 5, 


we have j = £+ 3.) 

Either of these are equally good bases for the Bohr Hamiltonian, since the energy levels really only depend on the 
principal number n. But we do have preferred bases for the other two terms, and that makes the analysis a little more 
complicated. If we just think about the fine structure term by itself, recall that the correction to the Bohr energy 


Bohr — _ me*a* |Qoks like 


1 n 3 
fs _ 2,4 
SA wee nt (43 3) 


(in other words, we now get a dependence on / but not directly 2). And on the other hand, if we think about the 
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Zeeman Hamiltonian terms, which looks like 

Hz=—(ji. +ils)-B, jx=—-5—, fe=—-—. 
If we plug in the magnetic moments and carry out the dot product, we find that 

eB 
Hz = ome be =f 253); 
which has a simple operation in the uncoupled basis but not the coupled basis: 
eh 
Hz |n, 2, me, Ms) = ——B(im + 2m) |N, 2, Me, Ms) , 


2mc 


where the 5“ term is the Bohr magneton (approximately 5.79 x 10~%eV/Gauss). 


Fact 50 


The reason for these two different preferred bases is exactly why it’s difficult to treat these two effects as a 


single Hamiltonian perturbation (we would need to find another good basis). This is particularly notable in the 


intermediate-field case where the fine-structure and Zeeman terms are on the same order of magnitude. 


Remark 51. Because we're not probing the quanta of the electromagnetic field (unlike something like the photoelectric 
effect, or transitions between hydrogen energy levels), we don't need to do quantization, and we can treat the (classical) 


magnetic field indeed as a regular vector. 


We analyze the Zeeman effect by thinking about it in three regimes: weak (where the fine-structure Hamiltonian 
is much larger than the Zeeman Hamiltonian, meaning the good basis is |n, 2,/, mj)), strong (vice versa, and using 
the good basis |n, 2, mg, ms)), and intermediate (where we don’t know as much). In each of these three situations, 
we have a different “zeroth order Hamiltonian” (Bohr plus fine-structure for the weak case, Bohr plus Zeeman for the 


strong case, and just Bohr for the intermediate case). We got the formula 


jG+1)-e@+1) +3 
2f+1) 


ED) = (n.£,j,m|Hz|n.£.J,m) = weBmjgej), gel) = 14 


And remembering that = 2+ $, we can simplify this expression a lot to see what the two cases yield, and a lot of 
cancellation happens: it turns out that 


ion 
~ 2@4+ 15 


1 
j=lts = g(2) =14 


The most interesting case that we can talk about is the n = 2 energy level: recall that the H) = Hponr + Hee 


unperturbed Hamiltonian leaves some states degenerate (since it only cares about n and j), so we have 2Sj/2 and 


2P,/2 at one energy level (for a degeneracy of 2 + 2 = 4 because we can have mj; = £1/2 in each case) and 2P3/> at 
another energy level (degeneracy of 4 because mj; € {-3, —}, 5, 3}). And now we can think about what happens as 


we turn on our magnetic field B and see the Zeeman effect at the weak level: 
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Energy 


Lo ee 
aU) = : 
1 3/2 3/2 
f. S/he 
ara i 3 175 
1 3/2 =3/2 
g@ 1/2 1/2 
2Pi/2 i a2 172 
254)2 1 1/2 -1/2 
0 1/2 -1/2 
ai) = 2, 5 


weak-field Zeeman effect 


The top states split based on the value of m; (notice that ge(j) is constant for all four states there), and then the 
bottom states split based on ge(j) = 2 for 2=0 and g(/) = 5 for £= 1 because j = 3 and we want a negative sign. 
But as we increase the magnetic field, we can see that the differences in splitting will increase, and we need to start 
thinking about the intermediate regime (and toss this description of the energy splitting). 

On the other hand, the picture looks very different for the strong-field Zeeman case (as we're doing on our problem 


set). In this case, our unperturbed energy levels are 


13.6 eV 
FO = EBohr + Ez = 7 + Up B( me + 2ms), 


and then we need to compute the first-order energy shifts of the fine-structure Hamiltonian to understand the strong- 


field Zeeman effect. The answer turns out to be 
e) — mcza* (3 £(2 +1) — mm, 
fs 2n8 An 22+1/2)\(+1) J)’ 


where the curly brace basically means that this expression is bad for 2 = 0, and instead we use 1 for the 2 = 0 case. 


Now that we know this formula, we can start thinking about degeneracy, and one way we can also think about what's 
going on is to switch bases from the uncoupled to the coupled basis, where the constants in the change-of-basis 


transformation are given by (using a resolution of the identity). 
|n, 2, 1m, Ms) = S- [n, £,j, mj) (n, 2, J, mj|n, 2, me, Ms) . 
SM; 
The bra-ket terms here are called Clebsch-Gordan coefficients, and they basically give the probabilities of being in 


various states when we switch bases. Remember that if we have a fixed my and m,, we know that we must have 


mj; = m+ mz, (because J is the sum of L and S), and furthermore we know that / must be 2+ 5. So the sum above 


really only has two terms, and what we end up calculating for our energy shifts is 
(n, £, me, ms|H¢s|, 2, Me, Ms) = ye (n, £, me, ms|n, 2s’, mj) (n, £5", mj|Hgs|n, 2,4, mj) (n, fj, mln, 2, me, Ms) . 
JJ 


And now notice from the matrix element of H;,, and the other two bra-ket terms are basically giving us probabilities 
|2 


Px =| (0,2, me, mg|n, 2, J, mj) |- of being inj = 2+ 5: they are the squared Clebsch-Gordan coefficients. 


Next time, we'll build on this and fill in more about what's happening in the intermediate and strong regimes for 


the Zeeman effect! 
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13) March 10, 2021 (Recitation) 


We'll study the Zeeman effect a little bit more today. First of all, here’s a general schematic of what happens as we 
increase the strength of the magnetic field in the Zeeman effect (not perfectly to scale, and also with states labeled 


by |, mg, ms) in the limit, meaning that some change of basis is occurring in the middle): 


Energy 1 1,5) 
1,0,3) 
2P. 

3/2 ‘i 0, 1) 
1,1,-4) 
ites) 

2P1/2 0,0,-4) 
2841/2 8 i 
2 
iS) 
>B 


weak strong 


Basically, we can label things in the strong limit with the (2, mg, ms) numbers, and we are asked to do the classifi- 
cation of states in our problem set. The idea is that some of the states come together because there is degeneracy, 
and the number that really matters most is the value of myg+2m,. (Then the fine-structure term is a smaller splitting, 
but we need to work out what happens to each of those on our own.) 

Right now, we'll talk about what's going on in the middle of this picture, which is the intermediate-field regime. 
Because each of the Hamiltonians Hz and Hs are not good bases on their own, we'll just start with the |n, 2, /, mj) 
basis to work in, and we'll diagonalize the matrix that comes up in doing degenerate perturbation theory. The types 


of matrix elements we then need to think about will look like 
(n, 2,5", mi| Hp + Hz|n, £4, mj) . 


The fine-structure part of this is easy, because the fine-structure Hamiltonian likes this basis. But we still need to 
think about the matrix elements of the Zeeman Hamiltonian, and in principle this means we need to compute an 8 x 8 
matrix (for the n = 2 case) and diagonalize it to get the good states and the E“) energy shifts. (And we would then 
be able to reproduce the answers in the weak- and strong-field cases by taking B — 0 or B > ov.) 

We won't go through all of the details here, but we can at least understand why a lot of the matrix elements are 


Zero: 


Problem 52 


Take n = 2, and consider the change of basis that takes us from (n,£,j, mj) to (n,2, me, ms). What linear 


combination of the uncoupled basis states are each of the coupled basis states? 
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First, we can try writing out all eight of the coupled basis states: 


3 3 3 3 
212.8 Vda 
2, 35). 5 5) 


3.1 11 3 1 me 
21,2 = ane =) eee) OAL. =e 
2, 35). BS) LS 3) Ls: 5). 


- The first two states are easy, because mj = 3 implies that we must have mg = 1 and m, = s (there is no other 


way to get a total angular momentum large enough). Similarly, mj = -3 implies mg = —1 and ms = —3. So 


33 1 3° 3 1 
24, 51,1) ie) Sa 
| 53) .= pans). as. 5). pa =. 


(constant is 1 because of normalization, and also @ is conserved). 


+ For the next two states, 2= 0, so mg = 0, and thus m, = mj. Therefore, 


OG | ~|2,0,0,2 2,0,+ u = |2,0,0 : 
2'2 catia) erty ee Oeil ell 


This is nice, because it means that for these first four (simpler) states, the matrix for Hz looks the same in both 


bases, and thus the [6H] matrix's top 4 x 4 component is diagonal. 


- Next, notice that we can apply J_ to get from the first state |2, 1,3, 3). to the fifth state |2, 1,3, Ber and we 
know that J_ = L_ + S_ (for working in the uncoupled basis). Computing in the coupled basis, we have 


vs ome _ 5). = mii +1) = mm = 1) 2,1, 5. = 5). = m3 23. 5. = a) 


but computing in the uncoupled basis yields 


1 . 1 1 
= (Fs oe) a LA, 5) = hv2 a 1,6, 5) uc + nlp. 1-3, -5) ae 


uc 


so comparing the two expressions yields 


We can get similar answers for the other states: 


pu55) = 72:05) ue Ve fata, 5) 
eel ous) a Vic} eae ie) uc Pe) uc |r 


vad-}) = phd) apa) 
sad} = pena) pat) 


This is nice, because this means that the change-of-basis has blocks of size 1,1, 1,1, 2,2: matrix elements between 


coupled basis states only interact if there are overlapping uncoupled basis states in the change of basis. So the only 
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nonzero entries in this matrix look like 


(o) 

Or 1 
oO Oo 
OO "Ov "oO 
Oe Oh: Or. 3) 


[5H] = 


* 
* 


oO OO Oo oO 
oO OOO Oo 


* 
* 


> es es ee ee 
Oe Oe On On Os 
oOo 0 OO x 
oOo OO oO * 

* 

* 


* OO 


Diagonalizing this is now much easier (we only need to solve 2 x 2 problems), and that will give us a good basis. 
It turns out that the top four states have a term linear in the magnetic field 6, while the other four states have a 
more complicated functional form (because we need to solve a quadratic equation). So there are indeed four straight 
lines in the figure above: the top and bottom lines in each group of 4 correspond to the diagonal terms in the matrix 


above. 
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We're ready to leave perturbation theory and the hydrogen atom behind for a bit (but we'll come back to them later): 
it’s time to start a new topic, the semiclassical approximation, also known as the WKB approximation (credited 
to 1926 work by Wentzel, Kramers, and Brillouin). The idea is that we get approximations to solutions of differential 
equations with slowly varying coefficients, and in our applications to the Schrodinger equation, we'll see that classical 
behavior illuminates the quantum wavefunction behavior. 

To start, recall that for a free particle with momentum p, we associate a de Broglie wavelength A = (and 
this appears because the wavefunction of the particle w(x) = e/?*/" has period A). And classical physics is a good 
approximation to quantum physics when A is much smaller than relevant length scales in the problem. 

One formal way to access this classical regime is to take h + 0 (because then the de Broglie wavelength is always 
small). We might object to this strategy because Planck’s constant is a fixed number in our universe, but the point is 


that this is primarily a trick for studying the semiclassical regime. 


Example 53 
Suppose we have a particle that is moving in a potential V(x), and consider some energy E. We know that the 


energy of a particle is given classically by 


pe 
E = —+V(X), 
m 


We can rearrange this equation to find that 
p(X) = 2m(E — V(x), 


and this allows us to define a position-dependent de Broglie wavelength 


h Quah 


OD = BT = TER 
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If we now look again at the time-independent Schrodinger equation 
A 
—S_VPW(x) = (E-V(R))(X) > PVP = 2m(E— V(X) PCO), 
and now we can substitute in results from above: 
Pap = p?(X) v(x). 


The left-hand side has a square operator, while the right-hand side has a position-dependent function. And while this 
is a suggestive way to write the equation, it is not an eigenvalue solution (because p(X) is not a constant). 
Before we get into the WKB approximation, let's take a closer look at our particle wavefunction: we can always 


write a complex-valued function in terms of its magnitude and phase as 


UR, t) = Vole, tel", 
where p, S are real quantities. The reason for writing the wavefunction in this way is that p is then the probability 
density (because p(X, t) = |w?(x, t)|*), and we know that the probability density satisfies a continuity equation 


=> 


ale ae) 


where J = Aim(v*Vy) is the probability current. And in this current form of ~, we can write (using the product 


rule) 


=  1V0 isin, WP oo, i8/n 
Vw = 5 7a T hi VSe i 
and then multiplying this by wW* cancels out the exponential term to get 


1 ips 


But we want the imaginary part of this expression to get to the probability current, and because p,S are real, we can 


substitute in to find that 


In other words, surfaces of constant phase S have probability current perpendicular to those surfaces. 


Fact 54 


The reason of recalling this probability current story is that there is an analogy here with fluid flow, if we think of 


the probability as the density of a classical fluid. 


We know that the current for a classical fluid looks like 


so classically the gradient VS should (maybe) look like our position-dependent momentum p(X). (And that is indeed 


what we'll find as we proceed on! 
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Example 55 


Let’s check this logic in the case for a free particle 


We: t) = ex (# 


Indeed, we have 
S=p-X=Et = VS=f= pix), 


so our logic above at least checks out for this special case. 


To start setting the WKB approximation now, we will first write 


UK 1) = Vote ee = eS 


where we define a new complex number ; 
i 
S(x) =S- 5: log p. 


(Remember that because S is real and so is p, the first term on the right-hand side is real and the second is imaginary. ) 
We'll be limiting ourselves to one-dimensional physics, and we'll also be thinking about solutions to the time-independent 
Schrodinger equation (so there is no t-dependence). We now want to plug this e/S(/" into our Schrodinger equation 
p°w = p?(X)w(x), which can be rewritten as 

2d 


—fi 
dx2 


= p*(x)V(x), 


where remember that p(x) is the number 2m(E — V(x)). To plug this in, we first compute the derivatives 


y= is! iS/M apt — iS? elS/n 4 (is)? gia/h = 6" _ ins"yelS/M 


A A A 


and then we can plug this in and simplify common terms on both sides to get 


S'(x)? — ifS"(x) = p?(x) |. 


(At this point, we still have not made any approximations!) And the point is that we've converted the linear differential 
equation for w into a nonlinear differential equation for S. It might seem like we've made our lives harder, but the 
point is that this is a good starting point for the semiclassical approximation because of the factor of f (which will be 
used as an expansion parameter). 

As we said at the beginning, we'll take V(x) to be a slowly varying potential (on the scale of the de Broglie 


wavelength A). 


Proposition 56 


The /fS” term in the above equation is small in the limit where V(x) is slowly varying. 


To get some intuition for this, first suppose that V(x) = Vo is constant. Then p?(x) = 2m(E—V) is also constant, 
so we can define po = \/2m(E — Vo), and we know that the solution to the Schrodinger equation is 


w = e@lPox/h els/h S Dox. 


And indeed in this case, S’’ = 0, so it’s reasonable to believe that small variations will also lead to small values of S”. 


(Or alternatively, we can take f + 0 to make that term small as well.) 
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So now we're ready to do a power series expansion: much like in perturbation theory, we will write 
S(x) = So(x) + AS1(x) + f7So(x) ++, 
and we can plug this into the equation S’* — if'S” = p*(x), and we find (really only to first order) that 
(Sh + HS) + O(h?))* — iA(SY + HSY + O(F?)) = p2(x). 
Expanding this out and grouping in order in fi yields 
(So)? + A(2S9S} — 1Sg) + O(F?) = p(x). 


Here, we should be thinking of p?(x) as a classical quantity of order 1 = f°, and now we can solve the equation 
order-by-order in fh. To order f°, h?, we find that 


1S! 
SB =x). S.= Zen. 


respectively. The first of these equations has solution 


WH 

o~ 
II 
im 


ep(x) => | Sox) =+ / * dx'p(x) | 


‘0 


where we should remember that p(x’) = \/2m(E — V(x’). 


Remark 57. We know that the wavefunction in quantum mechanics is continuous, and in fact if the potential is 
smooth, p is infinitely differentiable. So we won't switch between the two signs in this integral for So(x). Also, we've 


combined the integration constant into the offset xo (so we can think of that as shifting the integration constant). 


And now that we have the solution to So(x), we can find the next solution nicely: 


/ +p'(x) = id 
2+p(x) 2dx 


Si = 


log p(x) = > | Si(x) = , log p(x) + C’}. 


So now we have an approximation to the wavefunction: 


‘ ; x / 1 
W(x) & elSCO/M — @i(SoOI+RS1C))/" — exp [ (+f dxé PO) — 5 log p(x) + c) 
xX 


‘0 


and now we can pull out the log from the exponential and the constant C’ (turning it into a constant A in front) to 


get 


W(x) & F355 exp 5 ‘s ax’) 


This is our WKB approximation, and we can notice a few things about it now. Because we like to write w(x) = 
y/p(x)eS9/ in terms of an amplitude and a phase, we can find that the amplitude is 


ele alah 
~ p(x) v(x)’ 


p(x) 


In other words, the probability density is large where the particle's classical velocity is small, which makes sense — 


particles linger in places where velocity is small. And the phase is 


s=4/ dx’ p(x’), 
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which tells us that (by the fundamental theorem of calculus) 


= VS Al? 1 Al? 
= CAE allan eee ot al 


m p(x) m m 


Notably, this probability current is a constant, and that's indeed necessary because the Schrodinger equation that 
we're solving is time-independent (so the probability density is time-independent, meaning that oe +0,J3=0 => 


J is constant). 


Fact 58 
We can think of the + solution in the + above (with J > 0) as a particle moving to the right, and the — solution 


(with J < 0) as a particle moving to the left. 


But we've been ignoring some of the details here, which we'll address now. First of all, we've been implicitly saying 
that E > V(x) (placing us in the classically allowed region), so that p(x) = ./2m(E — V(x)) is a real number. 
(After all, a classical particle cannot ever be at positions where V(x) > E.) But we'll treat the other region later. 

Often we will use the shorthand p(x) = fik(x), so that our solutions are of the form (renormalizing the factor of 
Vf through A) 


W(x) = A et! Se dx! k(x!) 


\/ k(x) 


But we know that two solutions to the time-independent Schrodinger equation with the same energy yield a general 


solution which is a linear combination of those two: 


A i [* dx'k(x’) B =i [* dx'k(x') 
W(x) = ce! ba HO) 4, Fete 
V K(x) V K(x) 
But in the classically forbidden region where p(x) is imaginary (because F < V(x)), we can take a shortcut and 
“analytically continue” what we had from above: we use p(x) = /hk(x), where K(x) = a se) is a positive 


constant. We then find the general solution 


C abs dx'k(x') D a Is dx'k(x’) 
V K(x) V K(x) 


(noticing that the / in the exponents are gone). And that means we get real exponentials instead of oscillating 


V(x) = 


exponentials — the first term here is exponentially growing for large x, while the second term is exponentially decaying. 
We should now be more formal about when this approximation is valid — in other words, how slowly does the 
potential V need to vanish? Recall that we derived these wavefunctions by formally expanding in /f, which means that 


we need the order fi! terms to be smaller than the order f° term. 


Example 59 


For illustration, we can compare Sj (one of the fi! terms) to p?(x) (one of the f° terms). 


The condition that needs to be satisfied is then 
nISh(x)| K p?(x) 


for all real numbers x where the approximation is being applied. We know from previous work that |56| = |p(x)|, and 


|S¢| = |p’(x)|, so the condition that needs to be satisfied is 


p'(x) 
p?(x) 


hilp’(x)| < p(x) => nl <1. 


_|d 
7 ar300 
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And remembering that the position-dependent de Broglie wavelength is aa our condition Is really that 


d(A(x)) 
eon 


for our approximation to be valid. (And indeed, the left-hand side here is dimensionless, so everything makes sense!) 


We can alternatively get intuition for this by multiplying both sides by A(x) to find that 
dr 
A— r 
| | «A 


and the left-hand side is basically the change in the de Broglie wavelength over one de Broglie wavelength. So if we 
are in the classically allowed region (for simplicity), and our wavefunction is oscillating but the period of oscillation is 
changing as a function of x, we need the period of oscillation to not change too much over a few wavelengths (so 
that the local period can be defined). 

However, there is one generic place where this kind of reasoning breaks down: when we look at the classical turning 
points (meaning that V(x) crosses from being below energy E to above energy E£), we always have p*(x) — 0. More 


precisely, we can approximate V(x) to be approximately linear around the turning point x = a, meaning that 
Vik}= a =9lx=2) 


for some g > 0. If we assume that x is still within the classical region, then 


27h Quah a 


P= ome =Vioj) =2 — x 
p? = mE — V(x)) = 2mg(a—x) = Nx) = Ty = 
Then we can look at 
dX| 1 2mh 1 
dx| 2./2mg (a— x)3/2' 


and clearly as x — a, we cannot have || <_ 1 because of divergence! So no matter how smooth our potential looks, 
we will always run into trouble with this semiclassical solution. To get around this, we know that we have a semiclassical 
solution in both the classically allowed and classically forbidden region (sufficiently far away from the turning point), 
and we'll stitch those together by solving the Schrodinger equation exactly near the turning point. Those connection 


formulas will help us connect the different solutions together! 


15 March 15, 2021 (Recitation) 


We'll start with a bit of review about the WKB approximation, giving some hints about where we're going next. Much 
like perturbation theory, this is an approximation for the Schrodinger equation, but this time the approximation has 
to do with how slowly the de Broglie wavelength is varying with position. Recall that if we write our wavefunction as 
p(x) = ef$0/", then we get the differential equation 


(S‘)? — ins" = p*(x), 


where p*(x) = 2m(E — V(x)). Then we expanded out this expression as a power series in fh, and we found the zeroth 
and first-order terms of the approximate solution. The result was that in the classically allowed regime E > V(x), we 


have 
W(x) _ A atte dx'k(x') # B ald dene) 


\/ K(x) \/ K(x) 
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where K(x) = a) = ¢\/2m(E — V(x)). (These two terms correspond to a left-moving and a right-moving particle, 


respectively.) But we also have the classical forbidden region, in which we get 


a I dx'k(x') Ee D wiley dene): 


Cc 
\/ K(x) \/ K(X) 


where this time K(x) = Be) = ;\/2m(V(x) — E). The four coefficients A, B, C, D are just generic numbers (since we 


can have linear combinations of our two original solutions to the differential equation). 


W(x) = 


And to tell whether this method is valid, we can check whether the second-order fi term is small compared to the 
ACD) < 1, where 


first two terms, or some other comparison of this type. And we find that the requirement is for eo 


(x) = eg) is the local de Broglie wavelength. Writing out the derivative, we have that || a | na 


— 2 
= Gs SO 


another way to state our assumption is that 
dp 
A(x) —] « |p(x)]- 
poage| < loco 


In other words, we can talk about WKB's validity in terms of changes to the local momentum p(x) over a distance of 
order \. We can also look more carefully at the equation p? = 2m(E — V(x)) and take an x-derivative on both sides: 


we find 


So we can write things in terms of the “force” now: 
dV 1 h 
A(x) —]} = — JA(x)pp’| = — Ip’, 
roa ge | == cope = 20 
and then we can substitute this into the above expression: because |hp’| < |p?|, what we find is that 


dV p? 
2m 


(where we added a factor of 2 to make it look more familiar). 


Proposition 60 


The equation |A(x) | < ee says that over a distance of A, the change in potential energy should be much less 


than the kinetic energy. 


So now that we have these two approximations: as mentioned at the end of lecture, we should try a WKB-type 
approximation for E > V and also E < V, but then near E = V we need something to stitch those two solutions 
together. That's because E = V implies p + 0, and that means all of the approximations we've described above break 
down. So we need to figure out the connection formulas, and we'll do that in lecture. 

But we'll write down the answers right now so we've at least seen them: suppose that E = V at x = a (and we 
have exponentially decaying solutions for E < V and oscillatory for E > V). Suppose that our solution on the left is 
oscillatory, and the solution on the right is exponentially decaying. Then we find that the coefficients A, B,C, D are 


constrained by the wavefunctions 


a6 cos ([ K(x')dx’ — 4 = TS exp (- [ K() dx) 


(in other words, if we know the form of the solution on the right, we know what the coefficients have to be on the 


left). Notice that the equation above relates C to a combination of A and B if D = 0. On the other hand, we also 


50 


have that 


Ea" (| Keeyax' - 7) => say ( ff eo02»’) 


(which relates D to a combination of A and B). And the arrows are important — we can’t apply the formulas in reverse, 
because an exponentially growing solution doesn't give us enough information to also learn about the exponentially 


falling solution (approximations break down). 


Example 61 


One of the applications of WKB is the tunneling phenomenon — we also need the connection formulas for a 


derivation here, but we'll just explain the answer right now. 


Basically, a particle of energy E can get over a hump of energy larger than E, and the WKB approximation tells 


us that the transmission coefficient (ratios of probability currents out ) is 


b 
T © exp (-2 H(t) dx) 


where the hump in V(x) goes from x = a to x = b. (Imagine that there is an infinite potential for x < 0, so this 
means that usually the particle is stuck in the left region from 0 to a.) And because « is proportional to J/V(x) —E, 
indeed making the potential larger makes the exponential smaller, which gives us a smaller probability of tunneling. 

If we know this T, we can then find the probability that our particle does not tunnel (that is, it remains trapped in 


the given region from x = 0 to a). If we know that the particle is trapped at some time t, then 
P(t+ dt) = P(t)(1—wdt), 


where Ww = ty is the tunneling probability per unit time. Here At is some characteristic time, which we can think of 


classically as “how long it takes for the particle to bounce off the potential at x = a,” which motivates the definition 


b / 
ar=2/ se 
a V(x’) 
P(x) 


where v(x) = =<*. And now rearranging, we find that 


P(t + dt) — P(t) _ 


aS dt 


wP(t), 


and thus P(t) = Po exp (—wt) = Py exp (—4) (the probability of not tunneling is exponentially decaying). Here, T = 4 


W 


is often called the lifetime of the particle in the well. 


Example 62 (Gamow’s a-decay) 


In nuclear physics, a nucleus can decay via N + N’ +a, where the a-particle has two protons and two neutrons 


(so N has two more protons than N’). Gamow’s model accounts for the instability by thinking of a nucleus as 


having a potential well. 


If N’ has Z protons, then the potential takes the form 


—VWY O<r<n 


r>n 


where we basically have a Coulomb repulsion beyond x > m. To explain this, note that two protons and two neutrons 


51 


almost form Helium-4 (without the electrons) and thus it is very stable. So once an alpha particle forms, there are two 
contributions to the potential — there is a Coulomb repulsion for x > r, and a strong nuclear binding force for x < ry. 


2 
2Ze* __ EE 


So if our particle is at some energy E > 0, so that we need to tunnel through the range [r, fm], where = 


Then we can use the above calculation to find the tunneling probability: we find that T = e~?7 for 


1”? 2Ze? V2mE f? / 
Ady r h 7 r 


This integral is some arcsine term, but the limit for large r2 >> rm, is that we have 


FE ae) 


Since fo ~ ‘ this means that asymptotically, we have y ~ FE: And for various different nuclei, we have different 
lifetimes, and indeed exactly like in this model, we can plot tunneling probabilities versus E. It turns out that this 
simple model does get the scaling model correctly! (And notice that even when ¥ changes by a little, the tunneling 
probability can change drastically, and thus we see the tunneling probability change significantly — order of 107° — 


across different nuclei.) 


Remark 63. The effect of the —Vo part does show up, but only as a prefactor in w = a — it doesn't change the 


exponential behavior as a factor of E. 


16 March 16, 2021 


Last time, we discussed the semi-classical or WKB approximation, which works well for slowly varying potentials V(x). 
There are two main regimes — in the classically allowed regime E > V(x), we have oscillating behavior, and in the 
classically forbidden regime E < V(x), we get exponentially growing or decaying solutions. And we expect this WKB 
approximation to be good when | | < 1, where A = oy is the position-dependent de Broglie wavelength. But no 
matter how smooth our potential is, that condition will break down near the classical turning point E = V(x), so we'll 
discuss how to deal with that case today. 

Suppose that our classical turning point occurs at some x = a, and our potential is such that V(x) > E at x > a. 


Then for large enough x >> a, we have 


w(x) = .ao exp (- a ax’(x') + Ja exp ([ ax’(x') 


and for small enough x < a in the region V(x) < E, we have 


aa Of dx'K(x’) — *) + TS sin ([ dx’ k(x") — ) 


(we've converted the exponentials to sines and cosines, and we can add the phase to make our lives easier later on). 


¥(x) = 


It may look like there are 4 independent coefficients here, but the Schrodinger equation is a second-order differential 
equation, so there should only be two unknown constants. And indeed, if we aren't making any approximations, we 
should be able to go from the solution on the left to the solution on the right (at least in principle), so we should be 


able to write C and D in terms of A and B, or vice versa. 
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Theorem 64 (Connection formula) 


There is a relation between the coefficients of w above given by 


G=2A,) =——B. 


We should note that if we plug in x = a into the WKB solutions with those coefficients C = 2A, D = —B, we find 


Wwee(X = a) F Wwwe(x = a*); 


in fact, the WKB solutions will blow up. We would have continuity of the wavefunction and its derivative if we had an 
exact solution, but that’s not what we havel 

Additionally, there are some important caveats that we need to keep in mind when using the formulas, but we'll 
talk about that later. 


Example 65 
Suppose that we have a potential V(x) such that V(x) = co for all x < 0 (so there is an infinite hard wall that 


cannot be passed). We are looking for solutions to the Schrodinger equation with E < V(oo) (these are called 


bound states). 


The boundary conditions we have here are that #(0) = 0 and (co) = 0 (otherwise we won't have a normalizable 
state). Let x = a be the classical turning point (so that V(a) = &), and for simplicity let’s say that V is increasing 
(and smooth) for x > 0 Then the WKB approximation in the x >> a region tells us that 


W(x) = 


eS exp ( [ ax’(x') 


(we can only have the decaying exponential term here because the other term would make the wavefunction not 


normalizable), and B = 0. We'll now use the connection formulas to go across x = a into the classically allowed 


region: we learn that D = 0 (so the sine term above is not present), and 


2A 2 T 
x) = —— cos dx'K(x' -3) 
W(x) = cos ( [dx'e(o) — 5 
And now if we implement the boundary condition =(0) = 0, we find that 


cos ([ awnw’)- *) =0 => [oxtuee) =n (n+) ; 


But here, k(x) = #./2m(E — V(x)) is positive, meaning we must have n = 0, 1, 2,---. And thus we have characterized 


the quantization condition for our bound states: 


1 a 
5 f dx./2m(E — V(x) =n (043) , NEZsol. 
h 0 4 ~ 


In other words, we indeed find that only discrete energy levels are allowed! And this calculation can in fact be generalized 
to smooth potentials V(x) where E < V(co), V(—oo) (so that the particle is classically constrained to some region 


[a, b], where a, b are the turning points). Then we analogously find that 


i 
/ arc K(x!) = 0 (n+ 5): n€ Zso\. 
a 


53 


Proof of Theorem 64. To get around the fact that WKB only applies far away from the turning point, we will solve 
the Schrodinger equation exactly in the neighborhood of the x = a turning point and then match coefficients from 
there. The idea is that if we're sufficiently close to x = a, we can approximate V by a linear potential, and there will 
be an area of overlap between our methods where the linearized V is valid, but so is the WKB approximation. 

Explicitly, suppose that V(x) — E = g(x — a), where g is the slope of the potential (the derivative V’(a). We know 
that our WKB solutions look like (if g is positive) 


Wr(x) = 555 exp (- a K(o) x’) + Tea exp ([ K() dx) 


Now turning to the region where the linearized potential is valid, note that 


(x) = (v(x) — £) = (x — a), 
so that 
i: K(x')dx’ = yom dx'(x’ — a)? = amd = (x aay, 
If we let 7 = Cre)" and introduce the dimensionless variable u = n(x — a), then our integral above becomes 4 2 13/2 


and «?(x) = 7?u. And thus, our solution that is valid for both the linearized potential and the WKB jfidens ase 


2 3/9 | 1 B 2/2 
a0) = again 2° (—30°") + igs (52°). 


On the other hand, the solution on the left is 


W(x) = reayes( ff ame - §) + aessn ([ ann), 


and we know that k(x) = #./2m(E — V(x)), so employing a similar strategy with the linearized potential yields 


2, 3/72 7), D _ (2, 3; 7 
W(x) = Tale ape 05 (3 |u| 4 aula 5" glul ae 


And we can now stitch these together with the Schrodinger equation: 


solution is 


te d?w d*y 2m 
SE + V(x)o(x) = EW) > SE = SEV (x) - EY. 
Plugging in the linearized form of the potential yields 
ay a 
r= (x W(X), 


and we can rewrite this equation in terms of u to remove the constants: we find that the left and right sides become 


(evaluating w at u instead of x) 


dy 7 dy oe 
dx2 du2’ 


(x — a)p(x) = nruy, 
and thus our solution simplifies to the Airy equation 


d2 
ay = u(u). 


du? 


This is a second-order linear differential equation with two linearly independent solutions, which are denoted Ai(u) 


and Bi(u). (There are certain power series and integral representations for these functions which are good for us to 
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know.) But from there, we can derive the asymptotic behavior of the two functions, and we want to match a linear 
combination of Ai and Bi with the WKB approximations, so we'll analyze those solutions at large u. It turns out that 


for u> 1, 


i. 9 a sn, 
Ailu) ~ sea on ( 3 ). 


and for u < —1, 


; 1 1 2 T 
Ailu) ~ org os (Zlul?” *). 


But this has the same form as our WKB solutions — we have the same exponential term and the same u?/* in the 
denominator! And there is no ambiguity in these normalizations — the ratio of the coefficients in the positive and 


negative u regions for Ai will stay constant no matter what normalization factor we put in front of it — the coefficient 


of the cosine is twice the coefficient of the decaying exponential term, which indeed means | C = 2A] as desired. And 


if we repeat this argument for Bi, we find that for u > 1, 


and for u < —1, 


so indeed the ratio here tells us that | D = —B |, finishing our derivation. 


Fact 66 


To understand why this intersecting region between the linearized V(x) and WKB solutions does exist, we need 


to understand whether the deviation from linearity happens early or later than the u > 1 limit for Airy. 


We know that i 
V(x) -E=g(x-a)+ a" (ax ay, 


where g = V’(a), and thus in the intersecting region, we need to make sure the next-order correction is small compared 


to the linear term: 
V"(a) 


(x-a) <1. 
Rewriting this in terms of u = n(x — a), our condition becomes 


2mg* | _ (2mv'(a)* | 
fi2 V"(a) fi2 V"(a) : 


V"(a) u gn 
1 — 
97 1S via) 


The other condition for our matching region is that we must have u > 1 (so that the Airy function asymptotics are 


valid). Therefore, the intersection region must satisfy 


(ne 2mV'(a)*\ "34 
he V"(a)' 
mV'(a)* ae 
which means that the condition for the connection formula to be valid is basically for | V’(a) << (a) (in 


other words, the second derivative is small with respect to the first derivative). And indeed, this condition is always 


satisfied when we take fi > 0. 
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In summary, we can write down a relation like 


ae cos ca k(x’) dx’ r) ao exp ( [ ax’(x') 


a ( [ky - 3) => Tas ([ ae). 


But now notice the arrows: the caveat here is that all of our solutions are still approximate, and thus there are 


subleading terms in each of the solutions (in the expansion parameter a). And when we have both a decaying and 
a growing exponential, the decaying term is very small compared to the growing exponential, so in particular the 
subleading terms from the growing exponential will be larger than the leading terms from the decaying one! 

So if we start in the classically forbidden region, and we know that there is only a decaying exponential, then it is 
valid to use the connection formula and turn it into a cosine for the classically allowed region. But if we try to go the 
other way and we only have the cosine solution, it's still possible that subleading terms gives us a small amount of the 
sine solution, and that will give a growing exponential which will always overwhelm the decaying exponential term! So 
we can't use the arrow in reverse. 

A similar argument works in reverse: the second arrow works in the forward direction, because a small cosine will 
be harmless when we start with a sine term in the classically allowed region. But it’s possible that if we start with 
a growing exponential term, continuing to the classically allowed region could introduce a cosine term of the same 
magnitude (because the exponentially decaying term could be hiding with the same coefficient and still be of lower 
order). 

We've been talking so far only about the case where the slope g = V’(a) is positive, and indeed similar derivations 


can be made for g < 0 (the formulas look very similar): 


— (- [ ax's(x)) = = Cos ([ k(x‘)dx’ — 7) 


7 . - (| ax'stx)) os re oh k(x')dx' — 7) 


Example 67 (Quantum tunneling) 


Suppose that we have a potential barrier V(x), and we shoot particles at the barrier with some energy E. We 


want to know whether particles are able to make it through the barrier, even when E is less than the height of 


the barrier. 


We may have discussed this general type of scattering process in previous classes, and in principle this is a time- 
dependent problem (we have a wave packet going in and a wave packet coming out). But luckily, the time-dependent 
problem can be reduced to the time-independent problem with particular boundary conditions, and we'll see more of 
this later in 8.06. 


At least for this one-dimensional problem, the boundary conditions are simple: if V(x) — 0 as x — -too, then 


ikx 


we send an incident wave Ae’** into the potential barrier, and we have a reflected wave Be~'** and a transmitted 


wave Fe'**. We can then solve the Schrodinger equation with the boundary conditions a(x) > Fe!kX as x > 00, 
. 2 
and w(x) > Ae’ + Be7'* as x + —oo. Then we can extract the transmission probability via T = (at) , and the 


2 
reflection probability is R = (at) . It’s a fact that these coefficients satisfy T + R = 1, and we're going to find 


approximations for T and R using the WKB method next time! 
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17 March 17, 2021 (Recitation) 


We'll focus some more on WKB today, particularly understanding how to use it to find quantization conditions for 
energy levels. Recall that the connection formulas can be used when searching for bound states, and our general 
strategy is to find the two points x = a, b where V(x) = E (assuming that V(x) > E outside of the region [a, 5]). 
Then we need to have exponentially decaying tails to have a normalizable state, and we found in lecture when we had 


an infinite wall at x = a that we get an expression like 


cos ([ k(x')dx’ — 4 


in the classically allowed value, and because we must have (a) = 0 we know that the argument of the cosine must 


: T 1 
_ 1 Pon ie ca 
a= | k(x‘) dx m1 G 5) T, 


where n € Z5o. (After all, the integral ie k(x')dx’ is a function of energy, both in its integrand and in the bounds of 


U(x) = 


take on one of a discrete set of values: 


integration.) 


Proposition 68 


One useful shift of the integral that we can make here (to change whether we’re integrating from a or from b) is 


a k(x')dx' = ie k(x’) dx’ -[ k(x’) dx’. 


With this, we find that the wavefunction wW is also proportional to cos (A — i. dx'k(x’)). Then we can use the 


cosine difference formula, noting that cos A = 0 when we have a bound state, and thus 
2A 


y= rasinasin ([ ar’ k(x) =+ aes sin ([ ax’ k(x!) . 


And this way, it’s built into the wavefunction that w(0) = 0, and also if we have a problem where we care about the 


nth excited state, we know that excited states will have more and more nodes in their wavefunctions. And it turns out 
that this formula gives us the right number of nodes in the nth excited state, because bis dx'k(x’) crosses multiples 7 


the right number of times! 


Proposition 69 
We've mentioned before that WKB is a large-n approximation formula, and we can understand that by looking at 


the condition : 
p(x) 
2m ~ 


dV 
poog < 


Here, we can get a rough estimate for p because k and p are very closely related: if p is slowly varying, and we 


i re 3 
a) ax’ p(x!) = (n+ 5) m 


then we can estimate that p should be of order pan And indeed, plugging this back into our condition puts an n 


know that the integral is equal to 


2 


term on the right side, and thus we can eventually make our approximation good for a fixed potential V(x). 


57 


Example 70 

Suppose that we have a double-well potential, where there is a quadratic potential centered at x = a for positive 
xX and also another one centered at x = —a for negative x: this can be encoded as 

a 


mu > 
= (Ix| - a), 


VC) 


Notice that V(0) = — We want to think about the case where a is large (relative to the quantum scales), 
so that our well is deep and also steep. Specifically, we'll take a >> Lo = a the length scale for the quantum 
harmonic oscillator. 

We have a parity symmetry here — we can take x +» —x and get the same V(x), and the kinetic energy & also 
stays the same, meaning that the whole Hamiltonian stays the same. In other words, there is a parity operator P 
such that [H, P] = 0 and P|x) = |—x). We can thus specify quantum numbers of the P operator simultaneously 
with energy eigenstates, and thus we will divide up the wavefunctions into even and odd parities: the even states have 
W(x) = Wn(—x) (parity of +1), and the odd states have w,(x) = —W,(—x) (parity of —1). And because we know 
that the jth excited state has / nodes, we must have the ground state be even, the first excited state be odd, the 


second excited state be even, and so on. Let’s call the ground state w+ and the first excited state w_. 


Remark 71. /n particular, it’s important what happens at x = 0: even states will not have w,(0) = 0, but (it turns 
out) they will have an exponentially small magnitude at 0, and the derivative at 0 will be 0. And the reverse is true for 
odd states, where p_(0) =0 but w_(0) =0. 


A good zeroth order approximation for this problem is to say that the well in the middle is very tall, so we can just 


think about the single-well wavefunctions. If we let Wo(x) = Tah exp (-23) be the ground-state wavefunction for 
ie) 


the simple harmonic oscillator, then we can form even and odd combinations of this: perhaps we have 


_ Wo(x — a) + Yo(x + a) 
= Pp 


If this were true, then the energies of the first and second excited states are the same, so what really causes the 


W(x) 


energy difference is the potential at x = 0. If the ground state and first excited state energies are E, = E — AE 
and E_. = E+ AE, it turns out that AE is exponentially small for a >> Lo, and that’s the quantity we'll be trying to 
compute right now. 


We can show that for general symmetric potentials 


ir j 
AE = —p+(0)w_ (0) 


(we can see Professor Zwiebach’s textbook for this). We'll try doing this with WKB now: suppose that we're looking at 
some energy E which is smaller than V(0), so that V(x) = E occurs at some points x = x1, X2 and also x = —x1, —Xo. 
We'll solve everything for just the positive x case by symmetry. 

The regions (1) x < x, and (3) x > xo are forbidden, while the region (2) x1 < a < Xo Is allowed. We'll start in 


the far-away region x >> x2: WKB tells us that the solution must take on the form 


da) = Tas oo (- f j ax’n(x')) 


and there is no growing exponential for normalizability reasons. From this, we can infer from the connection formula 
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that we must have a cosine wavefunction in the allowed region 


a" Gn dx’k(x’) — ) 


We also need to do something at the other classical turning point, so we'll use the trick from above to introduce a 
new variable @ = ii dx'k(x’), so that 


Wey (x) = Te cos ¢ = [ dx'k(x’) *) = aa ¢ [ dx'k(x') + 4 


and it turns out that it’s convenient to split this up with the sine difference formula as 


= TS sind cos ( [mee - ) — cos @sin ([ ae'ax’)- *)| . 


And now if we look at region (1), namely the x where 0 < x < x; (so that WKB is valid), it’s reasonable to form the 


tee) = Pr [no (f" dente) an (— fae). 


and writing the integral from 0 to x in terms of the integral from x to a via 


Wa (x) = 


linear combinations 


e = dx'k(x’) = [dst =$ - RO: 


Then we can plug those in and use the connection formula, and the cheat that we can do in this problem is to not 
worry about the arrow: usually we can only go one way with the connection formula, but here we're allowing ourselves 


to replace both exponentials with cosines and sines. We find that 


Vo)(x) = we [act cos ([ dx'k(x!) — 4 $e F/2 sin ([ dx'k(x’) — *)| 


And once we've done the connection in both directions, we can compare the two formulas for W2) and match up @s 
and ¢s: that gives us the desired formula 4Asin@ = 2De®/? and 4Acos@ = +De~®/2, and the ratio of these tells 


us that tan@ = +2e%, which is the desired quantization condition. (And in the limit of large ¢, we basically know to 
expand around @ = 5 to get the answer — it turns out that E = fw 9g, and thus AE = fe e?.) 


18 March 18, 2021 


Last time, we talked about the WKB connection formulas, which allow us to stitch solutions in the classically allowed 
and forbidden regions together (but only in certain directions). We explored some applications of this method to bound 


states of potentials, and we'll continue our discussion of tunneling today. 


Example 72 
Recall that we're assuming that we're considering a system where we have a potential V(x) with peak higher than 


the particle's energy E, and we're seeing whether a particle can go through it. (We assume that V(x) > 0 as 


26 ae oo.) 


We said last time that to find such a tunneling probability, it suffices to solve the time-independent Schrodinger 
equation with certain scattering boundary conditions: we must have w(x) — Fe'k* as x + oo, and we must 


have w(x) = Aelkx + Be7'kx as x + —o0. (Here, A, B, F correspond to the indicent, reflected, and transmitted 
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waves, respectively.) If we can then solve the time-independent Schrodinger equation with energy E, the transmission 
probability (also transmission coefficient) is then ler and the reflection probability is |B)? (it is a theorem that we 
will find 7 +R=1). 

Before we apply WKB to this problem, we can notice a few things: if the barrier V(x) is tall and wide, then T <1 
(the transmission probability is very small). And in this case, we expect that the solution mostly exponentially decays 
in the classically forbidden region (it would be strange if the wavefunction started to grow). 

So now applying WKB, let’s call the turning points into and out of the classically forbidden regions x = a and 


x = b (for a < b). We'll work right to left: in the range x >> b, sufficiently far away from the turning point, we'll have 


es exp («f° k(x’) dx’ — 7) : 


(The point of the m phase is just to make the connection formulas easier later on.) In principle, there should also be a 


V(x) = 


—i fe k(x') dx’ term, but remember that we have a boundary condition where there is only a right-moving wave! And we 


can to check that our wave is right-moving correctly in this case, because V(x) + 0 as x > 00, so k(x) = eS 
goes to veme =k. Therefore, 


xX 
i; k(x')dx’ 3 kx +C 
b 
for some constant C as x > ov, and therefore we have 


p(x) = toelket phase) 


Vk 
which is of the desired form (we can incorporate the phase into the amplitude, and it doesn’t matter because we care 
about the magnitude). So now by WKB, we can connect this to the solution in the barrier region [a, b], but first we 


rewrite the complex exponential as a sum of cosines and sines 


w(x) = a" ([ k(x')dx' — 7) + aa ie k(x')dx! — ) 


and the connection formulas (for a downward-sloping potential) tell us that a cosine becomes a decaying exponential 
(with a factor of 5), and a sine becomes a growing exponential (with a factor of —1). If we apply those formulas 


blindly, we find that in the range a< x < b (sufficiently far away from the turning points), we have 


W(x) & as” (- a K(o) dx) _ a exp ([ K(o) x’) 


But now we can talk about the arrows and ask whether we are justified in keeping both pieces here. Notice that as 


X gets sufficiently far away from the classical turning point b, the growing exponential will overwhelm the decaying 
exponential (so the first term is much smaller than the second). 

So it makes sense to just drop the first term, and that’s indeed what the connection formula tells us: we can 
legitimately keep the growing exponential when we apply the connection formula in this direction, but not the decaying 
exponential. (If we kept the first term, we wouldn't get into real trouble, but we would be claiming higher accuracy 


than we actually have.) That means we just have the second term 


W(x) & sas ([ x’) 


in the range a << x < b, and now if we use the connection formulas for an upward-sloping potential, we need our 
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integral to be based at a instead of b. So we'll rewrite 


ie K(x')dx’ = a K(x')dx! — a K(x’) dx’, 


and the first term on the right-hand side is just some constant 6, so we can rewrite (still working in thea<x <b 


Tas exp (- . Kd) , 


And indeed, this is what we expected at the beginning — there is mostly a decaying piece under the barrier as x increases 


region under the barrier) 


V(x) & — 


from ato b. So this allows us to apply the connection formulas now: for an upward-sloping potential, a decaying 
exponential becomes twice a cosine, and the arrow points in the right direction (and a negative sine becomes a growing 


exponential, but we don't need that). So that tells us that in the classically allowed region x < a, we have 


W(x) = aS cos ([ k(x')dx' — z) 


which we can write in terms of an oscillating exponential as 


= aa (cx ([ k(x')dx' — 7) + exp (-if k(x')dx' + *)) . 


Since we care about the coefficients of the right-moving and left-moving wave, we can take x —> —oo here, and because 


k(x) again goes to the same k = veme we have again 


a 
} k(x')dx > -—kx +C’ 


x 


for some other constant C’. And thus the form of our solution for x + —oo is 


@ 
w(x) ~~ (ee ae ante) 
vk 


and this gives us the desired form with the incident and reflected waves. We've now found an approximate solution, 
and if we read off the transmission coefficients, we find that the transmitted wave has amplitude wa and the incident 


7 6 
wave has amplitude ae Xe) 
2 


F/Vk 
Fe®/Vk 


—20 


TwkB = 


or writing out the expression more explicitly, 


b 
Twkp = exp (-2 ax’ n(x!) . 
a 


But notice that to the accuracy we’re working with, the incident and reflected waves have the same magnitude, so 


Rwxp = 1). And because we want R?2 + T? = 1, this approximation is only valid when T < 1, and this happens 


indeed when we have a tall and wide barrier. 


Fact 73 


There are corrections (in principle) that come in the prefactor of the exponential, so the accuracy we have here 


is only in the exponent —2 ie dx'K(x’). 
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And notice that if we had blindly applied WKB without dropping the first exponential term, we would find that 


e728 


a 
(1+ Ze-79) 


So as we said above, this isn’t a huge change to the answer, but we would just be claiming higher accuracy than we 
actually have (since we really don't even know the prefactor at all). 


Remark 74. /f we had a different k on the left and right, we would need to be more careful and define the transmission 
probability in terms of the flux or probability current instead of just the relative amplitudes, which depends on w*Vw 
(so it'll bring down a factor of k). 


Although we've computed the transmission or tunneling probability here, we're often more interested in the 


tunneling rate: 


Example 75 


Suppose we have a potential with a local (but not global) minimum, and we start with a particle localized inside 


that minimum. This is not a stationary or bound state, and the particle will eventually leak out — we are interested 


in how long it takes for this to happen. 


We'll solve this with a combination of classical and quantum thinking, starting by imagining the particle inside the 
well as rattling back and forth with some energy E. Then every time it “hits the right wall,” it has a probability of 
escaping through the barrier, and we'll use the WKB approximation for that number. 

If we need to tunnel through a potential with turning points x = a, b, we know that the tunneling probability is 
T = exp (-2 Hh dx'e(x")), and now we need to estimate how often it hits that wall. In the classical treatment, 


the time it takes to travel between turning points x = c,a in the local minimum (that is, the time between escape 


at=2 [2 =2m f * 


(because the particle needs to go from a to c, and then back to a). If we then let P(t) be the probability that at time 


attempts) is 


t, the particle is inside our local minimum well, we know that P(0) = 1, and we know that 
PUEPAt) = PST P(t) 


(there's an escape probability of T, as long as the particle is still in the well at time t). And if At is small enough, we 
can rewrite this equation as 


dP dP 1 
At—_ =-TP(t) = —-=-=P(t 
dt (t) dt T (t), 


where T = a. We then find that we get the exponential decay 


P(t)=et/" 


with the lifetime 7 = ae and decay rate 2, 


Remark 76. Remember that we only know T to exponential accuracy — the calculation doesn't determine the prefactor. 
So the factor of 2 in the classical estimate for At really doesn't hold up, and we just have an order-of-magnitude 
estimate. But (as we discussed in recitation) applying this to Gamow’s theory of a-decay of nuclei holds up well to 


experimental observations! 
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Fact 77 


If we want to think about the actual dynamics of tunneling, we have to solve the time-dependent Schrodinger 


equation, send in a wave packet, and understand how that evolves over time. That will give us, at large time, two 


wave packets moving to the left and right. 


We'll now move on to our next topic — motion of charged particles in electromagnetic fields. The idea is that 
we'll be able to address how quantum particles move in electromagnetic fields, but we'll encounter the gauge principle 


along the way. 


Example 78 


Suppose that we have a particle of mass m and charge gq moving in some electric field E(x, t) and magnetic field 


B(x, t). 


First, let’s treat the particle classically (we'll be treating the fields classically throughout). Then we know that the 
Lorentz force equation looks like 
d?x 


. si Wess 
share = Florentz = gE (x, t) + C. x B(x, t), 


and we want to now incorporate a quantum treatment of our particle and write down a Schrodinger equation. And the 
interesting thing is that quantum mechanically, we need to work with gauge potentials instead of the fields directly. 


Recall that the gauge potentials ® and A are defined as follows: because V - B = 0, we can write (at least locally) 


B=V x Al for some vector potential A, and because V x FE = 198 we have 


for the scalar potential ©, and thus Ea-yo = |, 


Proposition 79 


There are certain freedoms in how we pick © and A, and these lead to the gauge transformations. 


Notice that if we change 
Ai =A+VA 


(which doesn’t change B, because curl of gradient is zero), and we change 


(which doesn’t change E because the two contributions cancel out), we find that 


E(A’,¢') =E(A,%), BA’, 0’) = B(A, 0) 
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(the transformed potentials and original transforms give us the same electric and magnetic field). So (®, A) should 
really be considered equivalent to (%’, A’), and these gauge transformations endow the space of gauge potentials with 


an equivalence relation — an electromagnetic field is just an equivalence class under this relation. 


Fact 80 
It turns out that there are (¢, A) and (%’, A’) that give the same electromagnetic fields, but aren’t related by a 


gauge transformation. 


So even though they look the same classically, they are different quantum mechanically (because they are in different 
equivalence classes!), and thus they will be treated as different electromagnetic configurations. In fact, we will see 
that there are actual physical differences in how the quantum particle is affected, and we can see this through the 
Aharonov-Bohm effect. These differences do not arise in infinite Minkowski space R® x R;, but they do arise if we 


have a nontrivial topology on the space (for example, particles confined to move on a ring). 


Fact 81 


It also turns out that there are electric and magnetic fields E, B that satisfy Maxwell's equations but cannot be 


written in terms of the gauge potentials A and . 


Since the gauge potentials are the fundamental objects, those electromagnetic fields are actually illegal — those E, B 
configurations won't be allowed in the quantum particle case. (And again, this only occurs on spaces with nontrivial 
topology — we'll see this in the case of a magnetic field on a torus.) 

We're now ready to introduce the modified Schrodinger equation in the case of an electromagnetic potential: in 


free space, we have 

Ow 
ot 

Pe 


where originally we have H = 55. To introduce , we know that a particle has potential energy q® in a scalar 


ih = Hy, 


potential, so we can replace H — H + q® easily (for example, that’s what we did for the -£ term in the hydrogen 


Hamiltonian). But A is more complicated: it turns out that we need to replace the momentum operator as 
= = Oa = 
po p- CAR, t). 


This is called the minimal coupling, and it gives us the full Hamiltonian 


This is already the form of the Hamiltonian in classical mechanics, but we'll also understand later on why this form 
makes a lot of sense — we'll in fact be able to derive the quantum analog of the Lorentz force law starting from this 
Hamiltonian. 

Now that we’re in the quantum world, we should replace X with the x operator, and thus A is also a quantum 
operator just like p. And in the position-space representation, A acts by multiplication, but it can act more nontrivially 
in other representations. But the momentum operator p still satisfies [6;, Xj] = —/fd;;, and we still have f= —ifV in 
position space. So in other words, we can write the representation of H in position space as 


1 


H= 
2m 


ee : - 
( in — TAs, t)) + qo(X, t). 


64 


But it turns out the physical velocity now changes to 


so that the kinetic energy of the particle is still $mv?, and we'll understand this a little more later on as well. 


We can try to expand this square out and write (using repeated index notation) 


1 g q 1 q q ge 
= (6, = 0) (pi Ai) t gD = <—(pjpj — —Ajpi — —PiAi + ZAiAi) + 9®, 
2m C Cc 2m Cc Cc Cc 


which can be rewritten as 


=O 2 
= Pp g (A. 54 p - A) \ q n2 
~ 2m 2me ee ee i ae ae 


(Remember that A and jp do not commute, because A depends on X.) We could then write 


= 


p- A= piAj = Aipi + [pi Ai(%, t)] = Aipi — indjAj = A- if A, 
so substituting back in gives us 
=P 2 
p qd zx iqia 7, Y pp 
A: p+ -A4 A 
2m mc me 2mc? 


t+ q®. 


So now we should have a feeling for how these kinds of calculations go, and the next issue we want to discuss is that 
of gauge-invariance — how does it manifest in the Schrodinger equation? Going back to the Schrodinger equation 


OW 
Do. 


(sa ( inv TAR, 1) + a(x, :) w, 


if we replace the potentials with some gauge-transformed ones, we find that 


in =( : ( inv TAR, t)) +40", ») v, 


2m 


and we want to see whether a solution W to the original Schrodinger equation also solves the gauge-transformed 


Schrodinger equation. The answer turns out to be no, but there is a simple relation between the w and w’, namely 


a! (X, t) = exp = ) W(X, t). 


So we add a position- and time-dependent phase to w, but the amplitude does not change. We'll discuss the implications 


and derivation of this next time! 


19 March 24, 2021 (Recitation) 


Last lecture, we transitioned from WKB and tunneling to talking about quantum mechanics in the presence of elec- 
tromagnetic fields (which comes up in the form of a vector and scalar potential in the Hamiltonian). We'll try to cover 


a little bit of each topic today. 
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Example 82 


Consider the tunneling problem for a finite square barrier, meaning that 


VY O<x<a 


V(x) = 


0 otherwise. 


If we send in a plane wave of energy E (where 0 < E < WW), then we can treat this problem with the WKB 


approximation, and the tunneling probability 7 = e~7° (as we derived in lecture) is only accurate if 
1 a 
G= 5 f dx./2m(WY — E) 
Ah Jo 


is large. But we can evaluate this easily because the integral is a constant, so 0 = $/2M(Vo — EF), and thus this 
approximation is indeed accurate if a is large (wide barrier) and Vo — E is large (high barrier). 
And we can actually solve this problem exactly without the need for WKB here! The idea is to solve the Schrodinger 


equation in each region whee the potential is constant: the solution is exponential or oscillatory in each region, so that 


(letting k = ¢V2mE and k = #\/2m( — E)) 


Ae’ + Bek x <0 
W(x) = 4 Ce + De™ O<x<a 
Pel x2 a 
(we're only using an outgoing wave in the right region x > a, because we're treating this as a scattering problem like 


in lecture). And now we need to equate the coefficients A, B, C, D, F by using continuity of w and its derivative w’ at 


xX = 0 and x = a: those four conditions give us relative ratios of the five coefficients, and it turns out in particular that 


F|? 1 
1 + ZE(W—E) sinh (0) 
In the limit where @ is large, we have sinhé@ = .. so plugging in e allows us to drop the 1 in the denominator and 
find T = 16E (GE) e~°8 +... So the WKB solution does indeed emerge in the appropriate limit (with a more specific 


O(1) prefactor). 


Remark 83. Any potential barriers that are piecewise constant are easy to do with the WKB approximation as well, 


because the integrand will also be piecewise constant. So we can try that ourselves if we want! 


Recall that we used the example of positronium a few recitations ago, and we mentioned that there was a lifetime 
for positronium because of potential decay to photons. And we can in fact connect this phenomenon to the probability 
to remain trapped inside a well, which takes the form P(t) = e~*/7 for some lifetime T = ae (where At is the classical 
time spent between tunneling attempts, and T is the WKB transmission coefficient). Positronium was different because 
it doesn't have a physical potential in space, but the formula P(t) applies again because we have one quantum state 
(positronium) which decays to a very different quantum state (photons). And it turns out that if we think about the 


probability by integrating the wavefunction for positronium over all space, we have 


P(t) = ‘ dx bere-(x, t)/? 


instead of the usual 1. Basically, what’s happening is that our photons yy and yy7¥ are left out of our Hilbert space, 
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so we're losing some probability because of decay transitioning us out of the space. The idea is that probability 


conservation is connected with the unitary time evolution operator U(t, to), because the Hamiltonian H is Hermitian. 


Example 84 


Let’s think about some non-Hermitian effects: we replace our potential V(x) with the potential V(x) — a (which 


is no longer Hermitian). 


Then our Schrodinger equation becomes 


Oy Ro Lr 
Dae ag Pe ay) 


and we want to understand how this affects the probabilities. Normally we have 2p+V -FJ=0 (current conservation), 


but now that we added this iz term, we have 


OP Digi 900 Oy* 
ap pe OY oe Ya 


and plugging in the Schrodinger equation and its conjugate via 


Op ih _, i 7 Oy* oh oe rae keen 
Stam” © AY onY ae omy YT RY — on” 
yields an equation of 
Op if 7 oo Qa) r 
Bt 2m (p Vib — eve : A’ 


(the V terms cancel out because they have an /, but the [ terms don't because they are real). And now this first term 


is usually —V - J, where we have the probability current 
— fi ‘ Aly i: 
J=— in Vb) = SPV Hav), 
m m2i 


and we can confirm that taking another derivative of this gives us back the w*V2y — wV2W* above. And yet another 


way to write this expression is 
1 e 
= —Re(w"py), 
m 
because Im(z) = Re(—/z). But returning to the extra term in the boxed equation above, we know that 
d fe) 
—P(t) = — | dxp(x,t), 
SPC) = = | dxplx.t) 
and then we can push the partial derivative inside the integral and substitute in what we know to get 


= Op _ ae gf. E 
= f axZe = foxes 5 | oe. 


And because we're assuming there’s no flow of current at oo, the first integral is just zero (here we’re using the 


divergence theorem), and thus the equation simplifies to 
d [ 
—P(t) =0-— =P(t). 
<P(t) =0- = P(t) 


So originally we had a = 0, and now we have the exponential decay differential equation, as expected! So indeed we 


get the probability P(t) = Poe */7, where T = a. 
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Example 85 


In the remaining time, we'll start talking about the Schrodinger equation and adding in the potentials O(x, t) and 


A(X, t). 


Recall that we can recover the electric and magnetic fields via F=-Vo-— +94 and B = V x A, and we found 
that the updated Hamiltonian for a charged particle of charge q is 


1 


Aa 
2m 


ee. be 
(2 a A(x, i) + qh(d, t). 
If we treat x, p as the classical position and momentum, we get back the Lorentz force equations, so this is a valid 
classical Hamiltonian (like we may have seen in 8.223). And now if we canonically quantize this by turning x and p 
into operators, that's a valid description of how to get to this quantum Hamiltonian. We expanded out the Hamiltonian 


in lecture and found that 


a p qd > q? rv) 
ar mee ale A} ae rae, 


remembering that the order of the A and P operators matter. We can then write this Hamiltonian in a position basis 


to find he i " 
Se ae 4 a(V- A) + =" + 90, 
2m mc 


where we've replaced p by —ifV. So we have our usual V potential term, and we have a modified kinetic energy term 


where it’s useful to define a modified velocity via 


(it'll turn out that in this setup, the velocity is more fundamental than the momentum). And we'll close by talking 


a little about probability current: the probability density is still o = |q(x, t)|?, and we still have the same equation 


Zot V-J=0, but this time the form of Jis a bit different: doing the same product rule calculation or) as above 
gives us 
Oy ih _, G23 q a i { qf 
= A A)y -— = 
ot Tad ¥ mc Wy oma’ yy h \ 2mc? ae Pe 
and the similar equation 
Oy* oh 


vy + LA ours LG Awe i (P* 4 ae) y 
dt 2m mc 2mc A \amc2 49 
Adding together ye + ye “again gives us a few terms that don’t cancel: we end up finding that (after some 


simplification) 
Op 
at 


So our new probability current is 


ifi 5 Vs xXypeyy Ss 
= 5 (wvey — wv?y") + LV (ayy) = 


J= Re (v*7yp) |, 


where V = £ — ae (possibly as we might expect from our definition of the modified velocity above)! 
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20 March 25, 2021 


Last time, we started discussing motion in an electromagnetic field: we mentioned that we need to specify the vector 


potential A and the scalar potential ©, and they enter the Hamiltonian in the manner 


p= S5 


(Here, A and © are both functions of space and time.) We mentioned the principle of gauge-invariance last time: 
configurations of EandB correspond to choices of the potential, modulo certain gauge transformations. Specifically, 


we have the following: 


Proposition 86 


If w solves the Schrodinger equation, and then we do a gauge transformation 


ie ie ee 


c Ot’ 


then w’ = exp (#4) w (where this phase term does depend on space and time because A is a function of X and t) 


will solve the new Schrodinger equation. 


We will denote U(X, t) = exp (#4) for convenience. Before we prove this result, we can do a quick intermediate 


calculation: 


Lemma 87 


We have the relation (for any wavefunction w) 


(-inv = <2) Uw =U (-inv = tA) w. 


Proof. The left-hand side can be written by the product rule as 
pes : = q,47 D1 5 
—ih(VU)p — ihU(Vw) — (Uae = CUVAe 


(remember that A and U are both just functions of X and t, so they can commute past each other). Since U is an 


exponential term, we can simplify this as 


TO AUW 4 u( inv tA) wp TUVAW. 
fic G C 


ih 


And in fact, the first and last terms here are actually the same, and we get the expression on the right-hand side, as 


desired. 


Proof of Proposition 86. Suppose that wy’ satisfies the Schrodinger equation 


Oy’ 1 


" _ 
' ot 2m 


( inv ta) y L godly’. 


Our goal is to show that w satisfies the Schrodinger equation with A and ©. Because 7’ = Uy, we can simplify our 


equation a bit to 


i) _ de es Re LOA 
in (UY) = ( inv <2) uw +a(o- 25) ue 
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Simplifying this by the product rule, we find that 


ign Owl a Oe. LOA 
ihe ae UY + USE = 5 aU (in da) v+au( a) 


where we've used Lemma 87 twice on the right-hand side to move U past both copies of (—indj a 2A\), turning both 


A's into As, and we've also used that U commutes past ( — ish) on the right-hand side. But now the first term on 
the left-hand side cancels with the last term on the right-hand side, and we end up with 
0 1 ee =\ 2 
Ho u( inv TA) p+ qUoy, 
Ot 2m c 


which is the desired equation by multiplying by U~! on the left, since U is a unitary operator. 


So now that we understand how our wavefunction changes under a gauge transformation w’ = Uw, we can ask 
how the physics depends on the choice of gauge. First of all, the probability density p(X, t) = |W(X, t)|? is definitely 
gauge-invariant, because the phase does not contribute to the magnitude. But we may also want to check other 


physical observables, like the position, and that motivates the following definition: 


Definition 88 


An observable O[A, 4] is a gauge-covariant observable if O is Hermitian (meaning Ot = O), and 


O' = O[A’, o'] = UVO[A, o]U-!. 


(Here, we're thinking of U and O as operators, and the right-hand side expression is a conjugation operation.) 
What's important is that all physical observables in quantum mechanics are gauge-covariant — this is a postulate 
that we're working with. And what this gauge covariance guarantees is that the outcome of the measurement will not 
depend on the gauge, because the eigenvalues (possible values of measurements) of conjugate operators O’ = VOU! 


and O are the same. To see this, notice that if |x) is an eigenvector of O, meaning that 
O|x) = Ax), 
then U |x) will be an eigenvector of O’: 
O'U |x) = VOU" |x) = UO |x) = UA|x) = AU |x). 
So the set of predictions will be the same, and the expectation value transforms as 


(VIO|b) > (W'O'lY’) 


(under a gauge transformation, both the operator and the wavefunction transform). And indeed, 


(p'|O'|w') = (Up|UOU™*|Up) = (w|UTUOU*U|p) = (p|O|v) 


because U is a unitary operator. 
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Example 89 


The operator X is gauge-covariant. To show this, notice that X doesn't depend on A and ® explicitly, so X’ is just 


X, and then 


UxXU7! = XUU7+ = x= %, 


since U is a function of X (and thus commutes with x). 


Example 90 


The quantity O = p— 4K (which enters in the Hamiltonian) is also gauge-covariant, because O! = p — 
(replacing A with A’), and then 


ay (5- 4A) Ut =Vvou 


is true, because after multiplying by U on the right-hand side, this identity is just Lemma 87. 


We mentioned that we expect that the combination mV = p— 2A should be the physical observable, so this fact 


should be reassuring to us. 


Example 91 
The canonical momentum operator f is not gauge covariant, because A does transform under gauge transforma- 


tions, but the combination p — 4A iS gauge covariant. 


Proposition 92 


Sums of gauge covariant operators are also gauge covariant, and products of gauge covariant operators are gauge 


covariant if we don’t require the Hermiticity assumption. 


(We should try checking these facts from the definition on our own.) 


Remark 93. We're being very pedantic here about “gauge-covariant observables” and the outcome of the measurements 


being “gauge-invariant.” Outside of this class, though, we may hear the observables as just being called gauge-invariant. 


And now that we have more justification for the velocity V, we'll talk more about the Lorentz force law. Recall 
that classically, we just define V = Xand F= mx, and we also know that F= gE (x, t)+ q% x B(x, t). If we want a 
quantum analog for this, the analog to classical equations of motion are the Heisenberg equations of motion from 


8.05: instead of thinking of the wavefunction time-evolving as 


p(t)) = U(t) |Y(0)) , 


where U(t) is not the same gauge transform U that we've been talking about so far, and then we calculate expectations 


of observables via 
(A)(t) = ((t)|Al P(t) - 
But we can rewrite this expression as 


= ((0)|Ut(t)AU(t)|¥(0)) , 
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and thus the combination Ut(t)AU(t) is defined as the Heisenberg operator A;;(t) (so that the operator evolves, 


rather than the wavefunction). We then get the equation of motion 


dAy | 


ae [Hy(t), An(t)] + Gar 


And notably, if our Schrodinger Hamiltonian H = Hs is a function of p, xX, and t, then the Heisenberg Hamiltonian is 
Hy = H(px, Xn, t), 
and we have the commutators (for any fixed time ft) 


[>ie(t), ofy(€)] = ind. 


2 


If we now apply this formalism to our Hamiltonian for a charged particle in an electromagnetic field H = se (5- 2A) + 


q®, we can think about how the individual components here transform: we have 


oH = : Pon Ge 7 E (BH — 2A, i) + gO(Xy, tm 


(where we've replaced the positions and momenta in H with their Heisenberg equivalents). The second term q®(Xp, t) 


commutes with x}, and we'll rewrite the other term in Einstein notation (summing over /): 
i {1 - Ga BG es . 
5 Ee (Ph (A nn t)) (oh CA G8) .] | 
We can now use the commutation relation [BC, D] = B[C, D] + [B, D]C (we can check this ourselves) to find that 
this expression is equal to 
i oil , Cae ; dois. : : qd .:.5 : 4 qd... 
~ dh (He $4140) [He 2450) i] + [Hh SAG 9) (he $09) 

But now the commutators just evaluate to —/fid;;, because the A! term commutes with et So our final expression is 


ee 
~ 2m 


(25 (0, 2410) 4 (y 24608). 


So we now have an equation of motion if we move the m over to the other side: 


so Ape 
mx = Bu — A(X, t) | 


And indeed, this tells us that we make the definition we've been alluding to: 


Definition 94 


The (gauge-covariant) velocity operator V is given by 


And to get the Lorentz force law, we can apply the same technology, computing mv = mx through the Heisenberg 


equation of motion for the operator A = p— LA. It's a lengthy calculation, but the final result is that 


min = aE + $ (x By By x) 


And indeed, if this were classical, the antisymmetry of the cross product would give us back the ordinary Lorentz force 


law. (And E and B become Heisenberg operators by replacing X with xy.) 


Remark 95. The main point is that we can only ever measure the kinematic momentum, not the canonical momentum 
p, in experiments. And if we add additional forces to the picture, we'll need to add more terms to the momentum to 
get the thing that we actually measure in experiments! But then we usually get into quantum field theory because the 


expressions get much more complicated. 


Example 96 


We'll now turn our attention to a specific problem, the magnetic field on a two-dimensional torus. 


The results will imply that certain configurations of magnetic field that are okay in the classical world are not 
allowed in the quantum world. And this will show us the quantization of the magnetic flux, which will show that 
quantization of electric charge must be possible if we had magnetic monopoles! 

As a geometer would say, a torus is the surface of a donut. But more algebraically, we can think of a circle (the 
one-dimensional analog of a torus) as a line segment with the endpoints identified, and then a torus as a rectangle 
with opposite sides identified. (We can imagine taking a sheet of paper, rolling it up along one direction to get a 
cylinder, and then rolling it up along the other direction to get a donut.) And this is equivalent to thinking about the 


entire two-dimensional plane and making an identification according to the equivalence relation 
(xy) ~(xthaiy)~(xytLly) — forall x,y. 


(So for example, the point (0,0) would be identified with (Lx,0), but also (0, Ly), and (3L,,2L,), and so on.) We 


want to study an electromagnetic field configuration, in which we have a constant magnetic field 
B= Boz 


perpendicular to the surface of the torus, and there is no electric field (so EF = 0). We would then need our vector 
potential to satisfy 
Bo = OxAy — OVA, 


and one such choice that would work would be Ay(x, y) = Box and A,(x,y) = 0. But remember that we're on a 
torus, so we have a periodicity condition, and here the y-component of the vector potential is not quite periodic 
(since shifting x by Ly changes A,)! 

It turns out that we don’t need to assume the vector potential is periodic, because it's not a measurable quantity 
— instead, we only need periodicity up to gauge transformations. We'll implement this next time, and we'll see that 


this will give us the desired quantization. 


21 March 29, 2021 (Recitation) 


We've started understanding how charged particles behave in electromagnetic fields, and we'll see a lot of interesting 


examples of physical systems in lecture and our problem set. Today, we'll go over one particular example in detail: 


Example 97 


Consider a charged particle in a uniform magnetic field B = B2. Classically, if we translate our system in the 


Xy-plane, nothing will change. But we'll see something interesting happen in the quantum mechanical case. 
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In this case, we can choose to have zero scalar potential & (because we have a magnetic field, but not an electric 


a Geral 


field), and we can set 


for any vector potential where V x A= B2. We get some freedom of choice here, and we'll pick 


=~ B 
A= 5 (-¥%,0), 


so that (V x A), = oy — Oe = B as desired. (This is sometimes called the symmetric gauge.) 

Classically, we know that this particle will do circular orbits in the xy-plane (or free motion in the z-direction, but 
for simplicity we can assume that doesn’t happen). So the point is that we'll work with a 2-dimensional Hamiltonian, 
so that we have the operators 61, P2 = Bx, By and %, X%» = X,Y. If we then plug in A into our Hamiltonian, we find 


that (ignoring the z-component and writing out the squared vector component-by-component) 
H= 5 (Br + mua)? + 5 (Ba — mw)? 
= — mw. — — mwx,)°, 
Am P1 2 ohh P2 1 


where w = #8. is a characteristic frequency. And now if we want to do a translation (in quantum mechanics), we’re 
going to notice that we don't have symmetry even though B is uniform, because we have the translation in the 


X,-direction (for example) 
; 
T(b) = exp (— 5601) , 
which is a unitary operator which translates in the x,-direction by b. As a reminder, we can check that the operator 


acts on wavefunctions as 
Tt (b)&T(b) = % +b => T(b)v(m) = Oa|T |b) = Wa — Bb). 


But in this given problem, the momentum operators 6; do not commute with the Hamiltonian (because we see the & 
operators appearing explicitly), so we won't get a translation to an equivalent state. So we'll change variables a bit 
to make this Hamiltonian more tractable to work with: we'll change coordinates {x1, x2, P1, P2} > {7,@, P,, Po} such 
that 


1 
T = py + MW, e=5(4-=), Py = py —mwx;, Po = po t+ mw. 


Because of the convenient normalization that we chose, we then get a Hamiltonian that looks like 


tT =m 
fe a" Pe 
om? gue? 


where We = 2w = BT and now this looks a lot like a harmonic oscillator Hamiltonian! (The frequency we is called the 


cyclotron frequency.) But to make sure it actually is a harmonic oscillator, the two operators m and @ that appear 


should behave like x and p, in the sense that their commutator looks correct. And we can verify that directly: 


il 1 P2 ee a 
ld, rt] = 5, Pil 5 =, mu| a 5 iit 5 ih = if. 


So the problem can now be solved by using the usual raising and lowering operators, and we know the energy eigenvalues 


1 
E,= Ne (n+ 5) 


Therefore, a particle in a uniform magnetic field has quantized energies, and the ns here are called the Landau levels, 


of this system: 


which we'll discuss in lecture. But for now, we also want to think about why we picked the particular P, and Po: we 
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can compute the commutators 


p22] 1 


[P2.6)=0, (Pi.41= Slo. mil +5 [-mur, in + 3ih =0, 
and we can also check that [Po, 7] = [P1, 7] = 0 with a similar calculation. And now because H is just quadratic in 
the two variables a and ¢, the P; operators are momentum-type operators that commute with the Hamiltonian — 
“momentum-type” coming from the fact that our new operators also have the familiar commutators [x;, Pi] = [x;, pj] = 
ifid;;. 

So the point is that these operators will be able to generate some kind of translations like p; and p> originally did, 


and we'll analogously define 


i A ibmw ,. 
T,(b) = exp (— 701) = exp Guz + or i) . 


If we think about applying T(b) on a two-dimensional wavefunction w(x, x2), we then get 


ib 
Ti (b) (x1, x2) = exp (A) (x — b,x), 


(the exponential coming from the % part and the shift coming from the usual 6; part). So it may seem promising to 
look at the combination of operators {H, P,, P2}, and if we want to think about this as a complete set of commuting 


observables, we also need to consider the commutator [P;, P2], and in fact those do not commute: 
[P,, Po] = [pr, mwx,] + [—Mwx2, po] = —2imfw. 


Thus, what we should think about instead is the implication of this commutator on the translation operators 71(b1) = 


exp (—#b1P;) and the analogous T2(b2) = exp (—#b2P2). And now 
[T1(b1), T2(b2)] = Ti(b1) T2(b2) — T2(b2) Tr (b1) 


is going to encode the issue with how the two different translation paths differ (classically, doing the two translations 


shouldn't give us any difference). The trick for computing this expression is to first consider the function 


I | 
f (by) = T}(b1)P2Ti(bi) = exp (5a) Py exp (— HF) 


(which would be the “first term in the Taylor expansion” which we expand out the exponentials). And we can compute 


this with a trick: notice that by the product rule, 


df i i i ee 
an p11 (1) Pa PoTa (br) = 11 (1) PoP Ta (br) = PARC it P>]T,(b1) = 5 (—2imush) T] (b1) Ta (br) = 2mu, 


which means that we have (by integration) f(b1) = 2mwb, + f(0) = 2mwb; + Po, by just plugging in b; = 0 into the 


definition. So this function f shifts the operator P>, and now we can use this to solve our question by expanding out 


< ( ibo\" 1 < ( ibo\" 1 n 
T}(by)Ta(bs)Ta(bs) = TES. ( =) Hears —(T{Pn) . 
n= 


n=0 


and substituting in the values that we just got yields 


=> ( =) < (Py + 2muwbi)" = Tobe) exp (—2ibr bo) . 
n=0 : 
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In other words, we have the relation 
To(bo)T1 (bi) = Tr (bi) To (be)? forme 


things almost commute up to an extra phase term! And now if we pick a particular value of bj bz so that by bo "s = 2mn 


for some integer n (remembering we = 2w), then the phase term just becomes 1. So for example, if we have n’ = 1, 
then the relation is b,by = ant = ankle (where £g is the magnetic length, the length scale of the “harmonic 
oscillator” we've constructed) The magnetic flux Bb, b. then turns out to be quantized in terms of fundamental 
constants via Dp = and the point is that there are an infinite number of states (an infinite degeneracy) because of 


the different allowed values of n’. 


22 March 30, 2021 


We're continuing our discussion of motion of charged particles in electromagnetic fields today, looking more at a uniform 
magnetic field on a 2-dimensional torus. (And this problem is interesting because there are certain configurations of 
electric and magnetic fields that aren’t allowed in quantum mechanics.) 

Recall that the setup is as follows: we think of the torus as a rectangle with side lengths L, and Ly with opposite 
sides identified, or equivalently as tiling the entire 2D plane and identifying two points which are L, apart in the 
x-direction or Ly apart in the y-direction. (So we have the whole plane, modulo translations.) We then placed 
a constant uniform magnetic field B = Bo2 on this plane, and we then started thinking about how to define the 
potential A through the equation B= Vx A. 

There are many choices for what A we can use, but one such solution we mentioned last time was to take A, = 0 
and Ay = Box. We now need to think about whether this is a suitable configuration on a torus — indeed, because Ay 


only depends on x, we do have periodicity in one direction: 
Ay(x,y+Ly) = Ay(x, y). 
However, along the translations in the other direction, we have 
Ay(x + Ly. y) = Ayu y) + Bol x. 


So we need to think about whether A needs to be periodic, especially because it is not a physical observable — it’s not 


“gauge invariant.” What we'll do is take the vector equation 
A(x + Lx y) = A(x, y) + (0, Bolx) 


and try to write it as a gauge transformation, thinking of the second term as VA(x, y) for some A. And we can pick 


something like A(x, y) = Bol,y, and now that means our vector potential is periodic up to a gauge transformation: 
A(x + Lay) = AG y) + VAC y) 
So the wavefunction of a charged particle living on this 2D plane is also periodic up to that gauge transformation: 


Wxtley)=Ux YW y), 


iq\(xy) iqBoLxy 


where U(x,y) =e % = e %~. And under this particular choice of A, we have A(x,y + Ly) = A(x, y) and 


W(ixy+Ly) = (x,y), and therefore the boundary conditions seem to be properly satisfied. 
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Remark 98. Remember that the wavefunction itself is not measurable, so adding a phase when we translate is okay. 


(Quantities that are actually gauge covariant, like |w?|, indeed remain constant.) 


But we also need to figure out whether these boundary conditions are consistent: if we want to get from (x, y) 
to (x + Ly,y + Ly), for example, we can translate first horizontally and then vertically, or first vertically and then 


horizontally. Let’s check that those give us the same answer: 


Wxtloeytly)=Wxtley)=Uxyvxy), 


and (translating in the other order) 


Wxtlyyt Ly) =U(x,y+ Ly)w(x,y + Ly) =U(x,y+ Ly)w(x, y). 


So we must have U(x, y) = U(x, y+ Ly), meaning that 


iqBoLl iqBoLl p iqBoLlyLl 
5 AER) cay g | OE): 2a WOE) ag 
fic fic fic 


is the requirement for our boundary conditions to be consistent! And since Bol,Ly is the flux & of the magnetic field 


through the surface of the torus, we must have 


qe 2uh 
oo (Z2) =1 d= ot 


for some integer n, meaning that we have quantization of the magnetic flux! We will define 5 = aus to be the 
flux quantum, and this is the fundamental scale for our magnetic flux. For example, if we take the charge to be an 
electron, we have Go — 2nhe e 4.134 x 10~” Gauss - cm?, which is big enough to be measured. 


Remark 99. /n the literature, we might find Dy = one with an extra factor of 2, because that constant plays a role 


in superconducting physics. 


Fact 100 


We've been doing this problem very formally, treating the torus as a fundamental domain of the 2D plane. But in 


nature, we can think about a torus embedded in 3D space (like a bagel), and we want to ask if we can create a 


magnetic flux through the surface of that bagel. 


We then know that in standard Maxwell electrodynamics, 
o= [B-as= [(¥-B)av =o 


by the divergence theorem. So the flux through the surface will always be zero, which is not very interesting. But if 
we make an assumption that magnetic monopoles can actually exist, and one of strength ® sits inside of our bagel 


(meaning that V-B = 063(X—X)), we'll have a flux of © through the surface. Then a particle moving on the surface 


gq? 
2Qnhic 


of the bagel of charge q will actually still satisfy the same quantization condition € Z, even though our surface is 


no longer flat and the magnetic field is not necessarily uniform. Thus, we can turn the quantization statement around: 


Proposition 101 


If a single magnetic monopole exists in nature of strength @ exists in our universe, then charge must be quantized 


in units of gq = wee 


a 


Remark 102. We chose a particular A, = 0 and Ay = Box early on in this problem, and if we had chosen a different 


vector potential A, we would still get the same quantization condition. 


And if we have a more general two-dimensional manifold and we want to try to find A, we try to define it on small 
patches of the manifold so that the vector potentials (and wavefunctions) are related by gauge transformations Ajj 
when two patches /, / overlap. 

If we then have a “triple intersection” of three such patches, we have a consistency condition required — if we go 
from patch 1 to 2 to 3 back to 1 using the three gauge transformations, we must get 1. This gives us the cocycle 


condition ec (A2tAsthis) — 1, and this in fact amounts to the equation 


This is somewhat similar to the Gauss-Bonnet theorem in mathematics! And the point is that the condition that 
we're obtaining here is very general. 


With this, we're ready to move on to the next (very related) topic, Landau levels. 


Example 103 


Consider an infinite xy-plane (this time without the periodic boundary conditions for a torus), where charges 


are bound to move, and there is again a magnetic field B = B2. We want to find the energy spectrum and 


wavefunctions for the problem. 


We'll start with a classical description. A particle of charge g moving in this xy-plane will experience a Lorentz 
force F = qt xB perpendicular both to the field and the direction of motion, and thus we get a centripetal force that 
makes the particle move in a circle. We can then figure out the frequency of the motion by equating forces: 

qvB- mv? 
Cc ig 


qB 
mc 


Vv 
We = 
r 


The particle then moves at this cyclotron frequency w-., regardless of the radius of the orbit. 


We'll seem some features of this as we do the quantum problem as well. First, we start with the Hamiltonian 


ae oe ee oe ee ae 
a Sn (Os TAs) + om (By 7Ay) tr q®, 


and we need to figure out the scalar and vector potentials ® and A. We can again choose ® = 0, and much like before 
we choose A so that B = 0,Ay — OyAy — this time, we will pick A= (—By,0,0), the Landau gauge. Plugging this 


into the above expression 


1 
7 — By 4 
2m (0. i: 
and we want to find the spectrum for this Hamiltonian. To do that, first notice that [H, 6,] = 0, so that we can 


simultaneously diagonalize H and py. Recalling that eigenfunctions for 6, are plane waves along the x-direction 


v(x, y) = Wye 


(for some arbitrary function w(y)) with eigenvalue fik,, we can substitute this form of the solution into H and effectively 


get the one-dimensional problem describing motion along the y-direction: 


Hr. W(y) = Evty), 
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where we now have a constant instead of the p, operator: 


1 qBy = i 33 Dy il qB ? fhik.c ‘ 
Hk = hk, 4 = ; 
Be Dit ( et ge ) am'¥~ om 2" \inc} qB 


We call the blue quantity = to be yo, and ab. is the cyclotron frequency. So for each k,, we have 
2 
p 1 
H,, = — + =mw2 p 
kk 9m 9 c(Y — Yo) 


which means we have a harmonic oscillator centered around yo, and thus the energy spectrum is E, = fw, (n + 5) 
for n a nonnegative integer. Since those energies do not depend on k,, that gives us the full spectrum of energies, 


and the wavefunctions will be related to the nth eigenfunction ¢, of the harmonic oscillator: 


[kx 


Wn(Y) = On(Y — Yo) => | Wn (XY) = Only — yo)e’™ |, 


where our states are labeled by both n and kx, but the energies only depend on n (these different energy levels are called 
Landau levels). The key thing to notice is that because k, can be any real number, we have an infinite degeneracy 


at each energy level. (But we'll talk more about this shortly.) 
|? is uniform along the x-coordinate (because the plane wave 


always have magnitude 1), and will be localized around yo with characteristic length scale 2g = 4/ wae = a in the y- 


Visualizing these wavefunctions, we notice that |Wpy,x, 


direction, because of the harmonic oscillator localization. This quantity 2g Is called the magnetic length, and what this 


means is that we should imagine most of the wavefunction is localized within a horizontal strip of y € [yo — £8, Yo t a]. 


Fact 104 

Remembering that the central y-coordinate yo is defined to be a constant times k,, this means that the different 
Wn, Wavefunctions for a given energy n have similar probability density, Just translated vertically from each other! 
And taking appropriate superpositions of different k,s actually make it possible to create wavefunctions that are 


localized in both directions. 


Example 105 


Thinking more about the infinite degeneracy, it makes sense for us to try to look instead at Just a finite domain 


of length L, along the x-direction and Ly along the y-direction. 


Then the degeneracy of each level actually becomes finite, as we will now see. For convenience, let’s require periodic 
boundary conditions in the x-direction (though the qualititative features won't depend on these exact conditions). So 


then we must have w(x + Lx, y) = w(x), and because 


Wyk (X y) = Waly a yo)e'*, 


we must have ky = orm for some integer m. This is still an infinite degeneracy, but it’s countable instead of uncountable 
£2.2nm 


now, and notice that this also means the vertical locations yo = — 02k =— are also discrete — as a consistency 
0220 
Ly 


We'll be a bit less precise about how things work in the y-direction now, but the idea is that if the vertical 


x 


check, we notice that the spacing between these locations Ayg = goes to 0 as Ly > co. 


localization yo is far outside of the allowed range [0, Ly], that won't be a valid wavefunction. (In real life, we'd have a 


significantly different potential outside [0, Ly], which would change the wavefunctions.) So we'll say that we can only 
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have 0 < yo < Ly, and thus we have an N-fold degeneracy given by 


by. ixky — tely. tye 8 


By 2ml 2g BARE By 


N 


and we see the magnetic flux appearing again! In fact, the degeneracy is (roughly) “how many flux quanta can fit 
inside our allowed rectangle,” and if we want to make this more precise, we can use the torus boundary conditions in 


both cases (in which case the formula becomes exact). 


Fact 106 


[we take rly ivem anc) i 4 Teslay Chenin turns Ouk = ou eee i) ca 


Do 4x10-7 Gauss-cm 
the degeneracy of each Landau level is very large, and when degeneracy is large, we have high sensitivity to 


perturbations because of potential energy splittings. 


Remark 107. Physical realizations of this setup include graphene (a single atomic layer of carbon) and semiconductor 
heterostructures (where we take two different semiconductor structures and grow them on top of each other to get 


an interface). 


The idea of the latter is that some electrons will be stuck in the interface but free to move within it, which gives 
us a two-dimensional electron gas, and this led to the quantum Hall effect — effects of certain perturbations, such as 
disorder and the Coulomb interaction between electrons, lead to the integer and the fractional quantum Hall effect, 
respectively. Basically, as we cool a system down, we observe that the Ohm's law relation for the Hall resistance 
Ry = Ma that we get when we have a current and magnetic field perpendicular to each other stops being completely 


linear in the magnetic field B, and it is instead quantized in terms of fundamental units of nature: we have 


h 
Ry=—=- 
1 @2y 
for some certain integer or rational numbers v. And we've found that this effect holds up to very high experimental 


precision! 


23 March 31, 2021 (Recitation) 


We saw last recitation and lecture the concept of Landau levels (in two different gauges): basically, if we have a 
particle in a uniform magnetic field, the energies get quantized like a harmonic oscillator with cyclotron frequency 
we = ae We'll try to get a closer understanding of this concept, particularly the degeneracy of the different levels, 
today. 

Recall that the degeneracy of a fundamental domain — a rectangle of length L, and width Ly — has degeneracy 
factor Sa where £, = J is the magnetic length. And if we move the factors of B to the numerator, we end up 


with the ratio na where © = AB is the magnetic flux through the rectangle and ®o is the fundamental flux quanta 


2Qmhc 
q 


large degeneracy. 


= - So a larger magnetic field will increase the degeneracy levels, and for typical magnetic fields this is a very 


AmwWe 
2ah 


Remark 108. Yet another way we can write this degeneracy factor is as , so that the dependence on charge is 


encoded in the cyclotron frequency. 


But we'll do an even simpler calculation now to get a sense of what this degeneracy calculation is really doing, by 


counting states for the free particle. 
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Example 109 


Consider a free particle in a square box with L, = Ly = L, and suppose that we impose periodic boundary 


conditions. 


Solutions for this two-dimensional system look like plane waves 
wy = Nel elhy, 


where periodicity require us to have ky, = 2 y and ky = 2m, where n, mare integers. So the spacing between different 


energy levels is given by on and if we think about the energy in this system, we Just have the kinetic free energy of a 


free particle, which is 
2 2 
p fi 
+ —~ = —_(k2 + ky). 


ca Pk 
2m 2m 


~ 2m 


So we don’t have the same kind of degeneracy, but we can still look at a range of energies and ask how many states 
are in that range. We'll think about this by counting how many states have energies in 0 < E < Ema,y, and that’s 
basically a condition constraining k,, ky to be within a circle centered at the origin. So that inequality can be rewritten 
as 

2m 


ke + ky < Fo Emax 


(where the right-hand side is sometimes written as k2, where kp = vemE max is the Fermi wavenumber). Then our 
counting estimate will be that the number of states with E < Emax is the ratio of the circle area and the fundamental 


energy spacing area: 


1k? AME max 
number of states © (On/ Le a 


(as long as the number of states is large so that the boundary corrections are not significant). And if we compare that 


to the result we had above, notice that this looks a little bit like the Arie that we had for the magnetic field! But 
the idea is that there are less Landau levels when we have a nonzero magnetic field than energy levels when we have 
a free particle: basically, the discrete levels that are originally spaced apart by on in wavenumber are being squished 
together into a single Landau level. 

And to understand that statement, remember that the Landau levels are at energies Shwe, 3fiwe, 3We, and so 
on. So what we claim is that the degeneracy in the lowest Landau level is equal to the number of free particle 
states in 0 < E < fiwe, and then the degeneracy of the Landau level hw, is the number of free particle states in 
fue < E < 2hwe, and so on. That would mean that free particle states basically get “squished to the closest energy 


level,” and indeed, the number of free particle states in the energy band 0 < E < fiwe Is 


Amfiwe Amu, 


Qnh2 — Onh ’ 


and similar calculations work for the higher Landau levels as well. 


Remark 110. Our approximations here only work for a significant magnetic field B, because the area approximations 
only work in the high-degeneracy limit. So we don’t need to worry about transitions in energy levels from B=0to 
B # 0 not behaving — our calculations are just not valid if ® is on the same order as Po, for example. Plus, we depends 


on B, so there really aren't any discontinuities at all — the spacing and degeneracy just increase along with B. 


We'll talk next about something related to the Aharonov-Bohm effect and interference experiments: 
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Example 111 


Consider a solenoid with radius a which causes a magnetic field B inside the solenoid, but zero magnetic field 


outside the solenoid. The idea is that even though there is no B outside the solenoid, quantum mechanically there 


can still be some effects. 


To understand this, we'll take a circle C of radius r > a wrapping outside the solenoid (so that B is pointing 


through the surface encompassed by C): Stokes’ law tells us that the vector potential A satisfies 
[ot-A= [48-0 A), 
and now V x Ais B inside the part of the surface that intersects the solenoid, so this is 
=fTaB=o 


(the flux through the solenoid). But the left-hand side says that we’re integrating A only along curve outside the 


solenoid, so even though B=0 everywhere along C, we still have a nonzero A. And one way we can do that is to take 
= Dm . 
A==?¢ 


(in cylindrical coordinates (r, @, Z)) for all r > a, and then the curl formula in cylindrical coordinates gives us 


and both of these terms are indeed zero. So the magnetic field is correctly zero outside the solenoid (for r > a), and 
now our job is to understand what this nonzero A does to our system. We can consider the Aharanov-Bohm phase 
4(X,C) = nal dt. A, 
fic Je 
where C is some curve from 0 to X, where we can notice that Vg(x,C) = A(X) by the fundamental theorem of 


calculus (so the derivative doesn’t depend on the choice of path C). We'll use this to simplify the structure of our 


Schrodinger equation: we need to satisfy 


d ( inV TA)° ed | U(x, 1) = in w(x, t) 
2m ec ve. aye ee 
and to solve this we'll use a change of variable 

W(X, t) = IK O>(X, t). 


We can then compute what happens when we substitute this in: the product rule gives us 


q 
c 


4 Ay, 
Cc 


. * lS : hd = 
( inV A) V = e'9®%O(_jnVx) 4 “AW 
and now we see that we substituted in this particular choice of g because it makes the last two terms cancel. So the 
equation that the x obeys is the Schrodinger equation above, except setting A = 0, so what we've done is split the 
wavefunction into two pieces — one that knows about A, and one that doesn't! 

Therefore, the impact of this vector potential is to add a phase e’9*:© to the wavefunction, and we can now see 


how to use this. 
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Example 112 
Consider a two-slit interference setup, where particles that pass through form an interference pattern on a far 


wall. We want to understand the impact of putting a solenoid of flux © between the two potential “paths” (close 


to the near wall with slits). 


We know that superposition tells us that the wavefunction of the interference pattern at a point X on the wall is 


the sum of the contributions from the two potential paths, so 
WS Wy + We = CMR p(x, t) + eM) 0(z, t) = eM) [re4(x, t) + 9% x9(z, t)] , 


where the path C can be thought of as going forward along C2 and then coming back along Cy, which is a closed 
path! So then Stokes’ law gives us 

g(%,C) = ee xo, 
In other words, changing the magnetic field will actually change the interference pattern, because it changes how the 
two wavefunctions interfere with each other! But if we change B by the right amount to change the magnitude flux 
from 2 to 6+ Do, that leaves the interference pattern unchanged. So that tells us what we can measure about & 


using physical experiments! 


24 April 1, 2021 


We'll start a new topic today, time-dependent perturbation theory. So far, we've been mostly solving problems 
where our Hamiltonian is time-independent, but there are many important problems in physics where the Hamiltonian 


is time-dependent: 


Example 113 


If we shine light at an atom, then the electromagnetic wave has electric and magnetic waves oscillating in time, 


so the Hamiltonian of the atom will be time-dependent. 


So we need to be able to address such time-dependent problems, and we'll develop some new machinery for 
addressing them. And we'll be able to use this to understand systems like decay rates in excited atoms! 


Recall that quantum mechanics is basically about solving the time-dependent Schrodinger equation 
in |b) = H(t) lv) 
a 


If the Hamiltonian H(t) = H is time-independent, then we can simplify the problem, since we can find the eigenstates 


of H and write the time-evolution as 


(w(t) = D0 cre /Ft/ In) . 


But when the Hamiltonian is time-dependent, our life is more difficult, and it doesn’t actually help very much to find 
the instantaneous eigenstates |n), of H(t), because we no longer have a simple relation like |(t)) # 5°, cn(t) |1);. 
(We will discuss a situation later in the class, where H is varying very slowly, where |n), is useful. But for general 


time-evolution, this is not a good approach.) 
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Example 114 


We will often be interested in a setup where the Hamiltonian can be written as H(t) = H© + 6H(t), where we 


know everything about the time-independent H) term, and OH(t) is our time-dependent perturbation. And we 


will often take 6H(t) to be zero outside of a time interval [to, t¢], where the perturbation is turned on. 


Then if we know the spectrum of H(®), we know the evolution of the system at times up to to and at times after 
tr (given by the stationary state evolution that we described above). If we then assume that the system is in an 
eigenstate of H©) for t < to, it makes sense to ask for the probability that it transitions to a different eigenstate at 


t= tr: 


Example 115 


An example of such a 6H(t) is a laser that is focused on a hydrogen atom from time to to ty, and perhaps we're 


interested in the probability that the hydrogen atom has made a transition from the ground state (before to) to 


some excited state (after tr). 


To understand how to deal with these kinds of situations, it’s good to first go over some formalism and talk about 
the Schrodinger and Heisenberg pictures. In the Schrodinger picture, we have states |q(t)) evolving in time, and 
the Schrodinger equation ing |v(t)) = H(t) |(t)) tells us about the evolution of the system. Our observables As 


do not evolve in time: in particular, if we want to evaluate the expectation of any observable, we have 
(As)(t) = (h(t) |Aslb(t)) . 
But for any w(t), we can define a time-evolution operator U(t), defined by 


p(t)) = U(t) |P(0)). 


(This operator U(t) should be unitary.) If the Hamiltonian is time-independent, then we have the simple form U(t) = 
e/Ht/h But more generally, the expression is more complicated. And the Heisenberg picture starts by taking some 


of the equations above and reinterpreting them: we can write 
(As)(t) = (W(t) |Aslo(t)) = (h(0)|UT(t)ASU(t)|¥(0)) , 


and thus we can always work with the wavefunction at time 0 and evolve our operators instead! In other words, we 


measure the Heisenberg operator 
Au(t) = Ul(t)AsU(t), 


and thus the operators are evolving instead of the wavefunctions. (We can then write down a Heisenberg equation of 
motion, but we won't do that now.) 
These two pictures of quantum mechanics are completely equivalent, but we can now return to our Hamiltonian 
of the form 
H(t) = H© + H(t). 


We then want to see whether we can isolate the time-dependence coming from 6H(t), and that gives us the interaction 


picture, which is basically a cross between the Schrodinger and Heisenberg pictures. 
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Definition 116 


The interaction picture wavefunction is given by 


|D(t)) = eM */" w(t) . 


The point of this definition is that if 5H(t) = 0, then all of the time-evolution is due to the time-independent H©: 
la(t)) = e MOP Iep(0)) 
and thus the exponential factors cancel: 


loCe)) = eOeMeriHOH7™ (0)) = jv(0)) 


So lvie)) doesn't change as a function of time when there Is no perturbation, so any time-evolution of our system Is 
due to 6H. And if we know |b(t)), we can also recover our original wavefunction by multiplying by e7/4t/P, 
So now if we turn on our 6H, we can derive something similar to the Schrodinger equation: notice that by the 


product rule, we have 


2 0 |. j HO iH 2 fo) 
ines W(t) = eM M(H) y(t) + "Pin (t)) 


which simplifies by the Schrodinger equation to 
= ll —H + H(t)) [w(t)) = el */P5H(t) [¥(t)) . 
Since we want an equation in terms of w(t), we can solve back for q(t) and find that this is 
= el MAG (eye HOW |h(e)) 


where if we define the operator 6H(t) = e'4#*/MGH(t)e/H#t/" we have 


ing |b(t)) = dH(t) |P(t)) |. 


Here, 6H(t) is the Heisenberg operator for 6H, with respect to H)! So indeed, we have some aspects of both the 
Schrodinger and the Heisenberg picture here, and our goal will be to develop methods that help us solve for w(t). 
(Everything is exact so far — we haven't made any assumptions about 6H being small.) 

It's also useful to discuss how these equations look in an orthonormal basis for concreteness: suppose that we 
have an orthonormal basis of H‘®) given by H() |n) = E,|n) (and we won't put (0) superscripts this time for the 


unperturbed states, just for brevity). Then any wavefunction can be expanded in the orthonormal basis, so we have 


|h(t)) = S© en(t) [n) 


n 


for some complex coefficients c,(t). Then we know that 
\~(t)) = e iH t/n |-b(t)) _ e iHOt/n S- Gat) |n) 
n 
and because |n) is an eigenstate of H ©) we just pick up a phase factor 


= » Goer |n) . 
n 
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—Ent/M from the more complicated time-dependence of c,(t). And indeed, 


So we've separated the trivial phase factor e 
we recover the usual answer for time-independent Hamiltonians if 6H = 0, and thus c,(t) are each constant in time. 
But if we want to find the cps, we can try plugging the form of w(t) into the interaction picture equation and get 


some differential equations. We find that 
wae) ~ 
in > em(t) lm) = 8H(t) > en(t) In) 
m n 
and now we can take the time-derivative inside the sum to get (relabeling m to m’ because of the next step) 


So ihem(t) |m!) = bH(t) = S> cn(t)SH(t) [n) . 


nm n 


This is a “vector equation” in the Hilbert space, so the equality must hold along each component, and we can check 


this by taking the inner product of both sides with some bra (m|. Then orthonormality on the left-hand side gives us 


iNém = S~ c(t) (m|6H(t)|n) , 


n 


and now we will introduce the notation for the matrix element 


dH mn(t) |= (m|5H(t)|n) ; 


which can be simplified in terms of the original 6H as 
_ (mleM*/M6H(t)e iH #/A| 1) _ (me iEnt/A] 5r4()] e-HEot/™) 


and now the constant phase factors come out to give us (remembering that bra has an extra complex conjugation 


term) 


= ef(En-£0)/® (mlst4(t)|n) =[ eS Hnn( t) f 


where Wmn = Fu Fe is related to the energy difference between energy levels (this is the transition frequency). So 


we now know that 


ihém = S elton 6H mal t)Calt) | 
n 


and this gives us a set of coupled first-order differential equations for the coefficients c,)(t). 


Example 117 (Nuclear Magnetic Resonance) 


Suppose that we have a spin 1/2 particle in a magnetic field, so that the Hamiltonian is given by 


@ «x B. 


Here, w has dimensions of frequency, and s has dimensions of f, so the units work out. Now if the magnetic field 


is constant, the Hamiltonian is just the time-independent H©) = —woS, = fda 2 — the eigenstates are then |z; +) 


and |z; —) with energy Peg and in general a spin vector 5 will precess around the 2-axis with frequency Wo. 


But we can now consider perturbations to this system. First of all, we can consider a small additional magnetic 


field along the x-direction, so that we have the (still time-independent) Hamiltonian 
H = (Q, 0, Wo): (Sx, Sy, Sz). 


This then corresponds to a magnetic field that is slightly tilted off of the z-axis, and thus the spin precesses around 
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this slightly tilted axis at a slightly modified frequency |d| = fue +22. (So if the tilt is small, the modification of the 
motion will also be small, and we can find the new eigenstates as well — they will be spins aligned along the tilted axis 
with new energies Ala |) The point is that this small constant perturbation doesn’t change the motion much, because 
everything is still time-independent. 

If we instead apply an additional rotating external magnetic field, though, things get more complicated. (Exper- 
imentally, this often comes from a large external magnetic field for the z-direction, plus some additional oscillating 


radiation.) And we will pick the perturbation to match the frequency of the natural precession, so that 
dH(t) =Q (Sy coswot + Sy sin(wot)) . 


We'll solve this problem by going into the interaction picture, which basically means we go into the frame of the 


unperturbed precession at frequency Wo. To understand that, notice that 
H(t) = eHO*UMGH( te fHOt/N 
can be simplified by plugging in the explicit expressions: 
= Qelt7"/2 (§, coswot + Sy sin wot) err /?, 


And e~'0te*/2 is a rotation around the z-axis, so indeed we are corotating with the unperturbed precession (and then 


the conjugation puts us back in our original frame)! Now we can use the fact that 
@lwoter/2 6 eg Wwoter/? — § coswuot — S,sinwot, 


and we'll take t + —t so that 


—twota? /2 § alwoto?/2 = 
iy = 


é §coswot + Sy sinwot. 


Therefore, we have that 


S$, efwotor/2 — eivoto’/2 (§. coswot + Sy sinwot) , 


so we can plug this back into 6H(t) to get 
bH(t) = 2S, 


which is exactly what we expect: if we go into a frame rotating at Wo, then we have a constant perturbation because 


we rotate along with the rotating magnetic field. So the interacting picture wavefunction satisfies 


which is easy to solve: we have 
[B(e)) =e 5/" [HO)) = eS*/? [O)) 
(second equality by definition of W), so that 
HB(e)) = exp (— So) joo). 


Therefore, in the interaction picture, our spin is rotating at frequency Q around the x-axis, so if we go to the Schrodinger 
picture, we have 


[(t)) = eMOUP H(t) = eH eH Fe fxh(0)). 


So a spin that starts along the z-axis will gradually tilt into the xy-plane (and keep rotating around the x-axis), while 


87 


still precessing around the z-axis at a faster rate. And the lesson here is that a constant perturbation with a small 
strength doesn't change the motion very much (because the unperturbed energy spectrum cannot supply any energy to 
the system, and it doesn’t make transitions between systems effectively), but driving the system in a time-dependent 
way can cause resonance so that the motion will change dramatically. 

We still haven't made any assumptions or approximations yet, but now we'll turn to perturbation theory, which 
is what the interaction picture is particularly good for. Just like at the beginning of the class, we'll now look at 
H(t) = H© + 6H(t) for a dimensionless parameter » € [0,1]. If we look at the interaction picture Schrodinger 


equation, we want to look for a perturbative solution which is a power series in A: 
[B(t)) = PO (e)) + [PO CE)) + 2? |H(E)) + 


where we're expanding the full time-dependent wavefunction in A in the interaction picture. To understand how we 


find these terms, we integrate the interaction picture equation to find 


[ deine, |o(t’)) = af dt'6H(t’) |b(t')) . 


Dividing through by /f and noticing that the left-hand side is an integral of a derivative, ew have 


dH(t') 
if 


Iib(t)) — |G(0)) =A i gt SD pe), 


Since |h(0)) = |(0)), we can rearrange this as 


He) <1) +a fo ae Gey) 


So we now have an integral equation, and the reason we wrote things this way is that it is convenient for a perturbative 


expansion in A (since |b(t)) appears on both sides). We'll substitute in this equation back into itself to get 


~ ,OH(t dH(t" 
HB(e)) = 19(0)) + f° oe AD to +r fi ae geen) )). 
which simplifies out as 
ay : ,OH(t') 2 : / , OH(t') dH(t"’) Tall 
= |(0)) +a / dt —— |(0)) +2 | dt fl or aaa |p(t”)) . 
And now we again have a w on the right-hand side, so we can plug in again to find 
_ : ,OH(t') 2 ‘ / , OH(t') dH(t"’) 
= (0) +2 fo ae woyy a2 fae! fae OES (H(o) +00) 


(the O(A) part gives us a A? term, which would then be a triple integral, but we can probably predict the form of 


the solution from here on). And now, we can read off the zeroth, first, and second order terms, but we'll discuss the 


structure of this more next time! 


25 April 5, 2021 (Recitation) 


We'll talk about magnetic monopoles and the interaction picture today — we'll save the perturbation theory for next 


week recitation. Recall that in electromagnetism, we know that we have V - B = 0, but V- E = 47 for some charge 


88 


distribution given by p. And this fact is encoding the idea that there are no magnetic monopoles: otherwise, Gauss’s 


[é-as= [-Bav =o 


(where the integral on the left is over a surface, and the integral over the right is over a volume). But it seems like 


divergence theorem tells us that 


we might want more symmetry in Maxwell's equations between E and B. 


Example 118 


Suppose that we modify Maxwell's equations so that we have a magnetic charge density py and V-B= 470m. 


(We can also introduce other analogous quantities to make Maxwell's equations completely symmetric in E and 
B.) 


Now if we have a magnetic monopole at a point so that 


e 


5 Ma 
B = we 
we would find only a nonzero divergence at the origin: 
V- B=68(P). 


We then want to construct a quantum mechanical A that is consistent with B = V x A. Since this problem is 


spherically symmetric, we should use spherical coordinates (r, 8,6), and it turns out one result that works is 


€m(— cos@) » 


A= b=Ag(9. ob 


rsin@ 


(we just need a component in the g direction). We then find that (looking up the formula for curl in spherical 


coordinates and only keeping the terms corresponding to Ag) 


re) ; al 6) 


And we can indeed check that this indeed gives us the PSB that we want, so this is a valid vector potential! (There 


VxA=P 


are different As that also work, but we’ll work with this one.) 
So now that we're doing quantum mechanics, we can imagine substituting this A into the Hamiltonian. But we 
have a problem — as 0 > 0 or @ > 7, this potential A diverges. If we instead think about 


€m(1 — cos) » 


A — 
: rsin@ 


this solution also satisfies V x A = B, and as @ - 0 this solution no longer diverges. But this solution still doesn't 
work for 6 = m7 — we can also instead get a solution that doesn’t diverge at 7 if we write down 


= €m(—1 — cos@) » 
1S ena 


rsin@ 


basically replacing the 1 with the —1, but this time our solution will diverge at 0. So in other words, we can get A to 


be fine for all 0 << m—e orall @ >, but not both at the same time. 
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Proposition 119 


Because what physically matters is B rather than A, the idea is to patch together our two solutions from the 


different As to describe B in both cases. And in this way, we are able to cut out the singularities and start doing 


quantum mechanics. 


The idea is that because we have two different As that both give us the same B, we need things to agree in the 


overlapping region. In other words, we should be able to write the difference 


in terms of a pure gauge transformation VA. And because the gradient in spherical coordinates for the @ term is 
_¢ 8 
rsin@ 0d 

So we'll now understand what this magnetic monopole implies for the quantum mechanics: suppose that we have 


A, we find that A = —2em@ gives us what we want. 


an electron of charge q in this potential A. The two different As will give us two wavefunctions that are related by a 


U = exp (fA) term: 
w exp (Ane) 7 
n= ace. ne Ps 
fic 


If we look at the “overlap” region, for example at 6 = 4 (in the xy-plane), then we get a condition because we know 


that the wavefunction for the electron needs to be single-valued. In other words, 


Wi(r,0,6+2r)=(r,9,6), Wil 6,o+ 2m) = wi (1,4, 9), 


and then substituting in the U term gives us a condition 


2d€m 


Z|. 
fic = 


In other words, we have quantization of our magnetic monopoles — if g comes in discrete amounts, then that gives us 
a one-to-one correspondence with the quantization of the magnetic charge! 
We'll finish this recitation with some discussion of the interaction picture: recall that we started time-dependent 


perturbation theory last lecture, where we have a Hamiltonian of the form 
H(t) = Ho + 6H(t). 


Note that these two pieces of the Hamiltonian both generate time-dependence for our wavefunction — for an energy 
eigenstate, we get a phase even if the energy is constant. And the point of the interaction picture is to separate out 


the phase contribution from the other time-dependence: specifically, we cancel out the Hp phase contribution via 
[P(t)) = elt!" fap(t)) 
We then get the interaction picture Schrodinger equation 
 d,~ ~ ~ 
in, |D(t)) = oH(t) [¥(2)) 


for the interaction picture operator 5H(t) = e!*/9§H(t)e~/t/"_ And this is similar to the Heisenberg picture, in 


iHt/h 


which we conjugate by e instead so that the states are completely fixed — here, we're only moving some of the 


time-dependence to the operators, not all of it. 
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So if we take a basis of the Hp unperturbed Hamiltonian satisfying Ho |n) = E,|n), then recall that we have 
(m|5H(t)|n) = eS Hina (t). 


And the idea from here is to look at the representation of our interaction picture wavefunction in terms of the 


unperturbed basis: we write 


|P) (t) = So en(t) In), 


n 
where the c,(t) coefficients basically only encode time dependence due to 6H. And recalling that the probability of 


observing a state |m) at time t is 


Pry(t) = Kmlv(t))|? =| mlei**/*|ep)]° = |(m]d(2)) 


(in the last step, we have the e~!Mot/h act on the bra, and then the phase cancels and doesn’t affect the squared 
magnitude). And this last expression is exactly |¢m(t)|*, so if we can find those cm(t) coefficients, that’s often the 
most relevant thing we want to find. 


We then get a Schrodinger equation for c given by 


iMém(t) = S— cn( tle" FH a(t), 


n 


and we'll see an example of this how to use this on Wednesday where we don’t need perturbation theory! (Basically, 


we'll be looking at Rabi oscillations, where we have two-state systems and a solvable Hamiltonian of the form 


E iwt 
: af sn) =| oe. ae | 
ye wt 


H, = 
oF ion - yes 0 


And the point will be to understand how this time-dependent phase affects the evolution of the system.) 


26 April 6, 2021 


Our paper draft will be due on Wednesday, April 21 — if we haven't heard back from our writing assistant, we should 
email the course staff. And once our paper is in a reasonable form, we should exchange drafts with our peer editor 
and make comments! 

Last time, we started discussing time-dependent perturbation theory for Hamiltonians of the form H(t) = HO 4 
5H(t), where H® doesn’t depend on time and the perturbation 5H(t) is often only turned on during a time interval 
[to, tr]. It then makes sense to ask for the probability of starting in a particular eigenstate before to and trying to 
understand the different probabilities of ending up in other eigenstates after tr. 

We separate the trivial time-dependence from H©) from the nontrivial time-dependence from dH(t) using the 
interaction picture: if we define |b(t)) = elH@t/n |y(t)), then we get a Schrodinger-like equation ing |b(t)) = 
dH(t) |o(t)), but instead of the Hamiltonian, we have the operator 


5H(t) = OMG H(t) HHO HA, 


At the end of last lecture, we started treating 6H as a perturbation, and (just like in the first class of 8.06) introduce 


a dimensionless parameter A which we use for expansion in a power series 


|P(t)) = o(e)) + 1 (t)) +2 }o(t)) Tevet 
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We found that the zeroth order term for |P(t)) is just the constant |w(0)), the first order term is the integral 


: to 8H? 
WWD) = [ae AE hwo), 
0 ih 
and then higher-order terms have a “nesting” of integrals: 


BC) = [ae fo der HHO yoy, 


Basically, everything always acts on the wavefunction |w(0)) at time zero, and then we integrate over all times 


0<t"” <t' <t, and having the operators 6H at times t” and then at t’ act on our wavefunction. 


Fact 120 


We can think of time as progressing forward (from 0 to t), and we always have 5H at an earlier time acting before 


6H at a later time. That’s why we often talk about time-ordered operators, and this is important because the 


operators at different times do not usually commute. 


(The analogous expression for |b)(t)) has n nested integrals, looking over all possible 0 < th < th-1 <-+: < 
t, < t and having the 5Hs at these different times act on |(0)). But we won't need it for what we’re talking about.) 
If we write this expression in basis form, where we're using the eigenstates |n) of the unperturbed Hamiltonian, 


remember that we get an equation 


|B(t)) = So en(t) In), 


n 
and we can expand out each term of the power series for |P(t)) in this same way as 
Ce) = D> che) In, 
n 


where the superscript k refers to the order in perturbation theory. If we plug this into the power series, we find that 


each Cy is then also a power series: 


Ch eee 
k 


And now we can start doing order-by-order calculations, equating the two forms for |b (t)).For order 0, since 
|p (t)) = |p(0)), we know that c(t) = c¢,(0) is just a constant. Next, for order 1, we know that 


We) = [ae joc) = POI. 


n 


and |w(0)) can be rewritten as }*,, c,(0)|n), so that we have 


[ae SS enlo) im) = So che) In. 


n 


Since this is a vector equality, we put a bra (m| on both sides, and we find (by orthogonality on the left and moving 


integrals and sums outside the inner product) 


c(t) - | FE n(0) (m|sH(t')|0). 
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The matrix on the right is defined to be 5Hmn(t’), so that we get the equality 


c(t) = a ap ocimnke sf se, Cn(0). 


(we assume everything converges so we can freely swap sums and integrals). And now because 6Hmn and dH mp are 


just related by a phase e™(t), where Wmn = eae we can write down our answer: 


Proposition 121 


We have the first-order correction to our coefficients 


eee Odea te 
c(t) = yi dtm a. ) (0). 
n 


So we can now calculate the probability of being in eigenstate m at time t, which is 


P,,(t) = |(mlb(t))|2 = |(mle™*/M| pct) | 


iHt/h 


(trying to write things in terms of the interaction picture). Again we have the e— act on the bra, which gives us 


a phase (because |m) is an eigenstate of H(), and that phase goes away since we're looking at the squared magnitude. 


So then we have 
Pm(t) = |(m|eb(t))|° = lem(t)?. 
And since we have a perturbative expansion for this, we can write it out as 
= |e) + (4 --f 


(we're going to set = 1 from now on). 


Example 122 


As described above, suppose that at time 0, our system is in eigenstate |n) — we wish to find the probability (to 


first order) Pyen(t) that the state will be in state |m) at time ft. 


In other words, we're told that |h(0)) = |n), so we have coefficients cg(0) = den (the only component we have 
at the beginning is along the |n)-direction), and thus the zeroth-order coefficients are simple: c(t) = Cm(0) = dmn. 


For the first-order correction c(t), notice that we’re summing 
1 OH me(t 
coy = 3 ff aterm intl) 6(0), 


and all of the c(0) vanish except 2 = n, so 


1. fe , 
c(t) = =f a t! eiwmnt OH alt’). 
ih Jo 
So now when we look at the probability 


2 
Pnen(t) = |Smn + c(t) + 2 (t) +--+ 
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there's two cases: when m ¥ n, the leading term is given by 


CP ei tip : 
Pneolt) =le@(2))? =| sa | fo atielm’Hmn()] + OCH)? 
0 


(since the c(t) coefficient is first-order in 6H, its square is second-order in 6H, and the largest corrections come 


from cross-terms between the first and second order corrections). But for m =n, we then find 
Pne-n(t) = |L-+ c(t) + op (e)P = 1+ cn? (t) + cp) Fo CL GCE) + GCE +) 
and then expand order-by-order to find 


= 14 (M(t) + (ty) + (1? + h(t) + cP (H)*) + OL). 


It turns out that (ao + cf (1)*) = 0, because 6Hpp(t’) is always real, Wn = 0, and thus we have a real integral 


divided by /f, meaning that c(t) is purely imaginary. So our first-order term vanishes, and then the next term 
(IP + c?)(t) + Pty") seems to have terms c(t) that we haven't calculated yet, so it seems like we might 


be stuck. But instead, we can just calculate 


Phen(t) =1- S- Pmen(t), 


mén 


since we know each of those probabilities to order (6H)?, and that gives us 


2 


1 : iWmnt! 3 
Puatt) = acne i dt’ elm" 6H nn(t’)| + O(5H)? |. 


mén 


And noticing that Ic) (£)|2 in the expression we tried originally is positive, so the contributions from c?)(t) and 


cl?) (t)* are important to make sure the probability P,—,(t) is less than 1! 


Remark 123. Notice that the issue with normalization factors from time-independent perturbation theory does not 
arise in time-dependent perturbation theory, because using the interaction picture keeps our states normalized. So as 
long as we work consistently to some order k, there will not be any need to renormalize the wavefunction, since our 


expressions for the perturbed eigenstates will always be normalized. 


Example 124 
Consider the case where 6H = V is time-independent, so that we have a constant perturbation, turned on from 


time O until some time tp. The perturbation can also be on outside of this interval, but the point is that we start 


in an eigenstate |/) of the original Hamiltonian H© at time t = 0 and want to find the probability we're in some 


eigenstate |f) at time t = to. 


We could use time-independent perturbation theory for this by finding eigenstates of H() + V and evolving each 
of those separately from time O to to. But it turns out to be more convenient to use the time-dependent formulation: 
our goal is to find (for all f 4 /) 


Pre-i(to) © [ck (to) |?, 
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where we know the expression for the necessary coefficient: 


ci (to) = a dt! = elit yy, 


Now V,; is a time-independent term, and the remaining integral is the integral of an exponential: 
_ Vai elie’) |” 
= ii IWF; 


= Wei _ plwryito 
Re et (1— efit) | 


= 


Vei jt 
~igenen(-am(“) 


because the phase terms go away when we take the squared magnitude: 


4\V_;|? t 
Pre i(to) © Veil ane (“s 2) | 


which we can further simplify as 


(Ep Ei) 2 


There are two main cases to consider here: we can make transitions to other energies of H‘°) (because the Hamiltonian 
changes when we turn on the perturbation, so energy doesn’t need to be conserved), meaning E¢ # Ej, or we could 
end up in states with the same energy, meaning E; = F;. (The subtlety here is that during the range from 0 to to, 
we have energy conservation if we’re in an energy eigenstate of H) + V, but we can’t say anything about H(© in 


general.) 


* When Er 4 Ej, we see that the probability of making a transition is proportional to sin? (2e%), and it oscillates 


2 
with period a and amplitude ae But remember that perturbation theory is only valid when this amplitude 


(the probability of making a transition) is small, meaning that we must have veil? > < 1. But this kind of 
[Er Ej 

expression should look familiar from time-independent perturbation theory — It is in fact the same criterion! And 

because we're doing a constant (time-independent) perturbation in this example, we shouldn't be too surprised 


that the validity criterion is the same. 


But as long as this criterion is satisfied, perturbation theory remains valid at all times, which won't be true in 


the next case: 


«+ When Er = E;, the expression E¢ — E; in the denominator does not make sense: we can go back to the integral 
and do it properly, or we can take the limit Er — E; in our answer. The two procedures give the same answer, 
so we'll do the limit here, and we find that (expanding sin x = x) 


p®) (to) © A\V¢i|? ; Wr ito = [Veil]? t3 
Bie Epa) Ne eo 


So the probability of making a transition actually grows quadratically with to, instead of oscillating like it did in 
the first case! So perturbation theory will break down here at large times because the probability must always be 
at most 1, but it will always work for sufficiently small times. (And this lines up with the fact that degeneracies 
require us to be more careful in time-independent perturbation theory.) In fact, the quadratic curve will be a 


good approximation of the sin? expression from the first case for small to. 
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Example 125 


Next, consider harmonic perturbations of the form 


2H’ coswt O0<t< ty 


dH(t) = 5 
0 otherwise 


where H’ is a time-independent operator. 


(A common example of this kind of Hamiltonian is where we shine electromagnetic radiation on a system, since 
the electric and magnetic fields oscillate.) We again care about the probability of transitioning to a final state |f) if 


we start in |/), recalling that the formula is 


1 se net, 
ce!) (to) = al dt'e* F(t’). 
0 


Plugging in the form of 6H that we have here, this becomes 


1 f* ‘eaest! 
== dt’el"™ 2 cos(wt')H¢;, 
0 


where H;, is time-independent so it can be taken out of the integral. We can also write the cosine as a sum of two 


complex exponentials, so that this becomes 


H.. st . 
= =e f dt’ (une ae giee-wl) 
ih Jo 


and now we can just evaluate the integral: we find that 


L(t) _ Hi. el(wri—w) to _ | el(writw)to _ 1 
f Wri — W Wj + Ww 


h 


This time, the special cases are the regimes where w is close to tw; — in those cases, the transition probability will 


be enhanced. 


° If wW & wry;, that basically means the perturbation is tuned to gives us enough drive to make a transition up by 
fiw (we almost have resonance in the system). In particular, if |w — wr;| < w, then the first term of oh) (ty) 


dominates, and the probability of a transition is (remembering to take the square) 


4|HI.|2 sin2 ((a52*) 
Pre_i(to) = t 
Fi ( 0) h2 (we; _ w)? 
So again we see the probability oscillating, with period weal and amplitude ee ae And if we are exactly 


on resonance, we get the quadratic behavior that we saw in the constant case also, and this is the case of 


absorption where the system energy very effectively. 


* On the other hand, when w ~ —we;, the system undergoes stimulated emission, where E; ~ E; — fw (so the 
system is tuned for a downward transition). We'll talk more about this later on when discussing interactions of 


atoms with light! 
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27 April 7, 2021 (Recitation) 


Today, we'll work out the example that we mentioned at the end of last recitation as an example of a solvable 


time-dependent Hamiltonian: 


Example 126 
Suppose that we have a Hamiltonian given by H(t) = Ho + 6H(t) for 


5H(t) = 


For this particular problem, we have two time-dependent coefficeints cy(t) and co(t), and the equation that we 


must satisfy is 


iney(t) = S— ca(te”"*6Hmn(t), 


n 


and the only terms with a nonzero 6H, are the off-diagonals, and thus the actual equation is 


ihc, (t) = co(t)e 2 *yel* = yon(t) el wa) |. 


(where recall that wo; = ae = —W 2, SO we have an extra negative sign in the exponential). Similarly, the equation 


for the other variable is 


ihéo(t) = cr (tel tye"? = yo, (t)e wa) |. 


Again, these are exact differential equations, and we can actually solve the problem exactly for this two-state system 
(we'll think about the perturbative solution soon). 
Suppose that we set up the initial conditions so that c1(0) = 1, co(0) = 0 (meaning that we're in state 1 at the 


start). We can combine the two boxed equations by taking another derivative: by the product rule, 
ihéa(t) = yer(t)e@-2)* — iw — wor yea (t)e Ye, 


and now we can plug in our boxed equations above to simplify this to an equation just involving co: 


2 
ine = Ate ecm) > ete a502 =, 
Since this differential equation has constant coefficients, we can try co = e** and we get 


a 


me O° 


dM + i(w — Wo1)A + 


(since the exponential cancels in all three terms), and by the quadratic formula we have 


.{ 1 ye (eta) i 
a=i(-F(w wn) +2), a=(F4 i ; 


So our general solution looks like (either writing as exponentials or sines and cosines) 
C2(t) = e/a [Aleit 4 Breit) — eH 1/2 wat TA cos(Qt) + Bsin(Qt)]. 
(And once we know the solution for c2, we also know the one for c,, but we'll just work with the former.) Plugging 


in the initial condition (0) = 0 tells us that A = 0, and plugging in c,(0) = 1 = + & = > (from our second boxed 
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equation above) tells us that 
emt! 
AQ” 
(the term where we differentiate the phase instead of the cosine is 0 at t = 0). In other words, we now have the 


QB cos(0) =s = B 


solution 


(t) = — heen} sin( Qt) 


This tells us that the transition probability from 1 to 2 (from time 0 to f) is 


ye 1/2 
Poea(t) = |ea(t)[? = —E ap sin? ([% rece vay") ) | 


Re 
2 (w—wo1)2 
r a ? 


and then if we want the probability of staying in state 1, we can just use |c,(t)|? + |c2(t)|? = 1. What's interesting 
about this result is that there are two intrinsic frequencies in the problem, and we can notice that the amplitude of 
the probability is largest at the resonant condition w = W2, where the two frequencies are tuned to be equal. (The 
width at which the amplitude becomes smaller than $ is 2) 

Fact 127 

Because sin? oscillates between 0 and its maximum amplitude, the probabilities of transitions will keep changing, 


and this is related to a phenomenon for the ammonia molecule that we might remember from 8.05. 


Example 128 


But now we can think about this problem from the perturbation theory perspective, so that 6Hy. = ye! is small. 


5 5 A 6 5 5 2 
This means we will effectively want to do a power series expansion in y, and we want to make sure Game = il. 


If we approximate our exact result above with this limiting situation, we find that 


en ee ee ee . 


(w — Wo1)? 2 


Let’s see if we arrive at the same answer through perturbation theory! The first-order correction to the coefficient 


C2(t) is (according to lecture) 


t IWo1t 
M(t) = fav Ha(e)ex(0), 


ih 
Plugging in what we know, this simplifies by integration to 
a eee 1] 7 Y___giluan-wtl2 . pi sin (wat) | 
if i(Wo1 — w) fi(wo1 — w) 2 


And indeed, if we square this result (meaning the phases go away), we indeed get the same answer as we expect. 
More generally, returning more to the material we've been covering in lecture, remember that we're doing a 


perturbative expansion of the form 
en(t) = on(0) + M(t) 4 OL 4---, 


where we have the relation 


© dee Olina e 
Cvri= >| dt! eluant’ mnt) rink ) «,(0). 
n 
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This can be compared to the differential equation ifiém = >>, Cn(t)e™’dHimn(t), and we're basically solving this 
differential equation perturbatively above by successively getting more integrals. Drawing more parallels with our 
example above, we have the initial condition for the kth derivative Y(t = 0) = 0 for any positive k, and if we want 


to calculate the transition probability P»_;(t), we compute c(t) |? with the initial condition c,(0) = dny. 


Example 129 (Constant perturbation) 
Consider a one-dimensional square well of length a which is usually an infinite well (V(x) = 0 for 0 < x < aand 


V(x) = co otherwise). But from time 0 < t < to, we add a constant perturbation to the well so that 


Van Ol xa7a//2 


Vixl—= <0 “ajax <a 


oo. otherwise. 


In other words, we raise half of the square well from time 0 to tg, and we've derived using first-order perturbation 


theory in lecture that the transition probability is 


AVei|? Wet 
Pe_i(to) = es sin? ( 2) ; 
1 


For example, if we start off in the ground state |/) = |1), we can calculate the probability that we are in state |f) = |2) 


at some later time t, and what we care about is the matrix element 


ele 2 2 
Vor = (2\V|1) =i. dxVo~ sin (=) sin (=) 
5 a a a 


(since we know the wavefunctions for the square well). And plugging this back in gives us the answer for the transition 


2n (-1)"/2 
1—n? 7 


probability without too much work! Generally, we find that (n|V|1) = Vo when n is even and 0 when n is 
odd, and we'll see that transitions to higher energy levels n fall off as - (But the assumption we do need to make for 
perturbation theory to be valid is that this coefficient is small to begin with, so Vo must be smaller than the ground 


state energy F.) 


28 April 8, 2021 


Last time, we started discussing some applications of time-dependent perturbation theory, concentrating on two kinds 
of perturbations (the cases where 6H(t) = V doesn’t depend on time, and where 6H(t) = 2H’ cos(wt) oscillates 
harmonically). We cared about the value of Pr.j;(to), the probability of transitioning from state / to f in the time 
interval [O, to], and the answers were that 


[Vril? 4sin® (5%) 


2 


Prei(to) © 7 rf 
fi 


for a constant perturbation (in which the probability is enhanced for wr; small, meaning that the difference between 


energies is small), and 


- 2 ( je Jt 
Asin (428 = 2) 


a 2 
Pr_i(to) es |H¢;| (we; + w)? 


for a harmonic perturbation where the energy difference is Es ~ EF; + fw, respectively (and in which we see the 


probability is enhanced for w = w¢;, leading to resonance). 
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Today, our next step is to start thinking about transitions to a continuum of states. We've been thinking about 
the transition probabilities for a fixed initial and final state |/) and |f), and we've been treating these as part of a 
discrete set of states. But there are two main situations that we might consider where we have a continuum of states 


instead: 


Example 130 


Recall that the hydrogen atom Hamiltonian has discrete bound states for E < 0 and continuum scattering states 


for E > 0. We may ask for the probability of transitioning from a bound state |/) to some state in the continuum: 


for example, if we shine light on a hydrogen atom, we want the probability that after some time, the electron 


becomes ejected and free. 


Example 131 
Another situation which comes up is where the spectrum of discrete states is embedded within the continuum of 
other states (for example, this comes up in autoionization). Then we may again want the probability that we go 


from a discrete state to the continuum. 


The key is that we will not be asking about the probability of being in a particular state of the continuum, only 
the entire continuum as a whole, and the perturbations we'll be analyzing will be again those where dH(t) = V or 


OH(t) = 2H’ cos(wt) (constant or harmonic). Motivated by the calculations we did last lecture, we expect most 


transitions to happen where E+ % E; for a constant perturbation, and where E¢ = EF; + fw (absorption or emission). 
And we'll find in both cases that the probability will grow linear in time, meaning that we can associate a finite 
transition rate (which was not something we could do in the settings before). 

Before we get started with this calculation, we need a technical ingredient — we need to be able to describe our 
states in the continuum. The way that we'll do this is to restrict ourselves to a discretization first — we place our 
system in a cubic box of size L. For a situation like the hydrogen atom, we know that the discrete part of the 
spectrum will be largely unaffected if L is large enough, but the continuous part will become discrete (because we 
have a particle-in-a-box situation), with spacing between those discrete energy levels decreasing as L — oo. So we will 
begin with a modeling by this set of discrete levels, and taking the limit later lets us check that L indeed drops out of 


all relevant physical quantities. 


Definition 132 
The density of states o(F£) is defined so that 


number of states dn in energy interval dE around E = p(E)dE. 


This is a useful quantity because we need to be able to take sums over this continuous part of the spectrum 
(since we want, for example, to add up the probabilities that we make a transition from our initial state to any of the 
continuum states), and the density of states helps us change this sum to an integral. Although p(E) as defined does 


depend on the size of our box, we will see that the L-dependence will cancel out eventually. 
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Example 133 


* 4 5 5 4 2. a 
Consider a free particle in three dimensions, so that Hyp = = so that our eigenstates are plane waves 


Wx, y, z) = el lkxxtkyy+kez) | 


2 k2 
2m * 


with energy E = 


If the particle is completely free, ky, ky, kz can be any real number, so the energy spectrum is indeed continuous. 
But now let's place the system in a box of length L: it turns out that the physics of the system in the middle of the box 
will be basically not dependent on the boundary of the box, so we should be able to use whatever boundary conditions 


are convenient for us. And the easiest ones to work with are often the periodic boundary conditions where 


V(xy.Z)=Vx+Ly.z)=Woayt+l.z)=Wy.z+L) 


ikl ikzL 


for all x,y,z. Plugging in our plane wave solutions, we find that e elkyL e 1, so that we must have 


k= oT ny for some integer n, and similar for ky and kz. So all states in our box will be labeled by these three integers 


(nx, Ny, Mz), and our normalized states in a box will look like 


1 201 ; 
is exp 7 (nx + nyy+nzz)}. 


We can now compute the density of states: we can look in momentum space at (kx, Ky, kz), in which our states occupy 


V(x%Y,Z) = 


a cubic lattice with spacing on If we now think about a box of fixed side lengths dk,, dk,, dk, in momentum space, 


the number of states inside that box will be roughly 


dkedkydk, L3g 
Gy Gre 


This is indeed a dimensionless number, because k has units of inverse length, and it diverges as L — oo (because our 
spacing continues to get smaller). And now we need to convert between energy and momentum, since we care about 


dn= p(E)dE. But because we have the energy relation 


h2k2 
E= ; 


2m 


we know that the contours of constant energy are spheres in momentum space of fixed Ik, so that states in an energy 
interval [E, F + dE] live in a spherical shell which has some width dk in momentum space, where we can relate dE 
and dk via 
fe 
dE = —kdk. 
m 
So putting everything together, we can divide our spherical shell into small boxes in momentum space, so that the 


number of states in the shell will then be ‘ 


(27)°" 


We can convert to spherical coordinates (since the integrand is spherically symmetric) so that our box now has some 


dn = d?k 


radial width dk and some solid angle dQ = sin@d@d@. So the volume element is 


d?k = k*dkdQ, 
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meaning that we can substitute in 


ae Exe L \? mk 
dn= (=) k2dkdQ= (=) k(kdk)dQ = (=) ae IQIIE, 


and now the boxed part looks exactly the density of states, except that we also have a “per solid angle” factor that 
will be useful for later calculations! And we can write our answer for p(E) per solid angle explicitly in terms of E by 
substituting back in with k = 276. 

Example 134 

Let's now consider the case where we have a constant perturbation 6H(t) = V, our system starts at a discrete level 


|), and this discrete level overlaps with the continuum of states labeled by |f) (where f labels over all continuum 


states). We want to find the probability that we transition from |/) to some |f). 


The total transition probability here can be written as 
Prrans(to) = S- Prei(to). 
f 


Since our setup here is to discretize our continuum spectrum, we can use the perturbation theory equation that 


we had above for the transition from some |/) to |f) under a constant perturbation! Substituting this in, we find that 


V;,|2 4 sin? Writo V-;|2 
Perans( to) — S- | fn al 2 = x | fi F (wei, to), 
f fi 


where we should again remember that wy; = ae Ve; is the matrix element between |/) and |f), and we define 


+2 
F(w,t) = ee And now the density of states comes in: we approximate the sum over f with an integral 


= [Veil 

= dErp(Er) 5 F(wri, to), 
where we're making the assumption that |V;;|2 doesn't change too much between nearby states so that we can actually 
evaluate this integral. And now at a fixed time t, we know that the function F(w, t) has a central peak around w = 0 


and some other peaks with amplitude decaying as s: 


Furthermore, this function becomes narrower and narrower at time t and thus becomes more and more peaked. So 


the point is that we're integrating |Vf;|?o(E-), some smooth function of energy, against this sharply peaked function 


F (45%, to) centered at E;, so we can approximate |V¢;|?(E) with its value around &;, and thus the transition 


probability for large enough tp can be written as 


Veil? (Ej Er —E 
nye ete) i) dE-F (= 2, ty). 
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Fact 135 


The way to understand the expression V;; in the equation above when taken out of the integral is to think about 
Vei = (FIV!) 


evaluated at the energy E¢ = Fj. It's possible that the final states f may be labeled by both energy and other 


variables, so that this matrix element may depend on which of the state |f) we choose at energy E-. So in those 


situations, we define the rate of transition for each specification of those variables, and then we can integrate over 


all specifications to get the total rate. 


This integral is now easier to evaluate: changing variables, we can write the integral part as 


oe) iA2 ( writo 

sin? (weit 
/ d(fiws;)F (we), to) = pf du AD 
—oo 1 


and now if we make another change of variables u = wate we end up with 


ee 2 sin? u we sin? u 
af unre ano f au i 


and this integral is just a constant which turns out to be m (we can do this by contour integration, for example). So 


notice that this is linear in to (which we could have anticipated because F(w, t) has largest peak proportional to t? 


and width proportional to +), and substituting this back into our transition probability gives us 


Qn 
Prrans(to) = =, Vail’ 0(Ei)to, 


and indeed, the probability of making a transition into one of the continuum state grows linearly with time! And this 


gives us the transition rate 


2n 
w= = |Veil*0( Ei) 


and this is known as Fermi’s golden rule. (What's interesting is that even though the different transition rates to 


states |f) oscillate in time, adding them all up gives us finite transition rate.) 


Remark 136. We can check that the dimensions of w are indeed inverse time (because V;; has units of energy, so 
\Vei|70(E;) has units of energy as well, and then dividing by h gives us the correct units. And we can also notice that 


the transitions are (increasingly narrowly) occurring mostly to states where E¢ = E;, specifically in the range 


2amh 2Eeee 2namhi 


E;— 
0) to 


for some constant m (which basically tells us how many “lobes” of the F function we want to include). In other words, 


the energy becomes “more and more conserved” as time goes on! 


But we also need to discuss the conditions under which this expression actually applies: we've already mentioned 
that we need to to be large enough so that we can approximate \Veil? (Er) by its value at E;. But we also need to to 
be short enough so that the probability of making a transition stays small, since time-dependent perturbation theory 


only applies if the wavefunction doesn’t change much from its initial value. We'll discuss this more next time! 
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29 April 12, 2021 (Recitation) 


We'll continue our discussion from last time, looking at harmonic time-dependent perturbations. First, a review: in 


first-order time-dependent perturbation theory, we want to solve for the mth state coefficient 
1. So at 
c(t) = af dt’ elmit 6H ni(t’) 
0 


which tells us about the (approximate) probability of transitioning from our initial state / to a final state m (this is like 
substituting in cj)(0) = 1 and c,(0) = 0 for all n 4 /), since P(t) = Ic (2) 2 to leading order. In the special case 
where we have a harmonic perturbation, we think about a particular kind of time-dependence 6H = 2H’ cos(wt)6(t) 
(the step function 6(t) meaning that we start the perturbation at 0), and we normalize it in this way because it makes 
it easier to substitute 2cos x = e/* + e~* in terms of exponentials. 

Generically, both of those exponentials come into the picture, but we like to study the special cases where one term 
dominates over the other, meaning that the w or —w balances out with the Wp; that is already in the exponential. We 
then found that if w & Wm; > 0, more specifically |w — Wmil < |wWmi|, we had (from here, we're switching to using f 
instead of m to match lecture notation) 


AIH; |? t 
Prei(ws to) = SS sin” (wn . v2) , 


We can think of this as an “absorption” situation, in which our state jumps up from |/) to |f) by an energy of 
E- —E; = hw¢;, because our incoming perturbation has frequency w = we;. (So this is kind of an “energy conservation,” 
where the energy fiw Is being taken out of the applied perturbation field and into our system. We don't actually have 
energy conservation here because our time-dependent field is inputting energ, though.y) 

On the other hand, we also have a case where Ww & —w;; > 0, and then we have 


A|Ht |? t 
Prei(w, to) = omtnaye sin” CG +w)2) _ 


This is known as “stimulated emission,” and the idea is that the perturbation field causes our system to release energy 


externally. 


Example 137 


Consider an electromagnetic wave (a solution to Maxwell’s equations with oscillating electric and magnetic fields) 


where we have E = EZ cos(wt—ky)@(t). (This is a wave polarized in the 2 direction, traveling in the 2 direction, 


and we should remember that 2 = c because waves travel at the speed of light.) 


We consider a hydrogen atom at the origin, in the ground state 1s = |1,0,0) at times t < 0, and we want to find 


the probability of transitioning from 1s to 2p at time to. The approximation that is good to make here is that 


2m 2c 
Aatom ~ao< Awave = ys = 
WwW 


It turns out that the frequencies that are most likely to cause a transition do always satisfy this inequality. So we'll 
make that simplification, and it makes it a bit simpler to simplify the applied electric field in this case. 

It does look like things are harmonically oscillating in this problem, but instead of a coswt, we have a cos(wt — ky) 
(meaning that we have a traveling wave). But the point is that our atom doesn’t really see the spatial extent of this 


wave — it only sees a y-patch of length Astom centered at a single point. So in other words, we can just substitute 
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y =O into our electric field and find that 
E = Eo2cos(wt — ky)@(t) = Eo2cos(wt)@(t). 


To get this into a perturbation, we need to write things in terms of the electromagnetic potentials, and here we can 
solve E=-VO => O=-7-F (we should check that this function © does indeed give us back our electric field), 


so then we find that the change to our Hamiltonian is 
6H(t) = qo = —qr- E = qEozcos(wt)@(t). 


So the coefficient we care about is H’ = — 1602 (the extra 2 because our perturbation should be 2H’ cos(wt), and now 


we go back to our formula above: we Just need to find the matrix element between 1s and 2p 


Ei 
"= (21m|H"|100) = = (21m|z|100) , 
and the only nonzero matrix element is where m = 0 here, because Lz and z commute and thus (21m|[L-, z]|100) = 0. 
(Alternatively, the matrix elements must all have even exponents of each of x,y,z, and we get a z from m = 1 but 


none from m= 0.) It turns out that this evaluates to 


qEo 


180/2 
5 


243 


dm0 a0 


so plugging everything back in gives us our answer for the probability of going from the 1s to the 2p state 


AP. to 
P; to) = a  __ sin2 Wr —W)— 
15—2p( 0) A2 (wei _ w)2 ( fi ) 9 , 
where this formula applies when w & ou = an (This is like “absorbing fw of energy to make a transition upwards 


in our hydrogen atom.) And we get a similar expression for the transition probability from 2s to 1p, and everything 
would stay the same except that we use wr; + w instead of Ww»; — w. And we should remember that in either case, our 
formulas are only valid when |w — wr;| < |we;| (otherwise we'd need to include contributions from both exponential 


terms and that gives us some cross-terms). 


Fact 138 


We may notice that our atom gained angular momentum when going from 1s to 2p, but that’s because “the 


electromagnetic field also has angular momentum” — photons have spin 1. 


We can also tweak our problem setup a little bit: 


Example 139 


Now suppose that we replace the hydrogen atom by a 3-dimensional harmonic oscillator, which has Hamiltonian 
252 


a ee ne ol 
Ho = 5m + 3Mwg x”. 


h 


We again make the approximation where the typical wavelength of the oscillator satisfies Ao = an 


< Awave: 
and we have energies Ey = (ny + ny +nz+ 3) hwo. A lot of calculations like before look similar, but we have a different 


matrix element 


(nny nz|z| nr, nf) , 


IT 
bh 


where z~ a, + al. Then the selection rules tell us that we must have ny = m,, ny = Ny. and n,= nd 
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Example 140 


Suppose that we have our hydrogen atom again, but we now replace our electric field to be a decaying exponential 


pulse E(t) = Eg2e~7*0(t). 


This time, our perturbation will look like 
6H(t) = —qEoze™, 


and this is neither a constant perturbation nor a harmonic one. So we'll need to redo the time integral 


t = 1802 
0 


Cis 2p if 


(the matrix element stays the same as before), and then the remaining time-integral is an easily evaluated integral 


—— (eliwar-v) to — 1). Squaring the result gives us the transition probability 
1 
Pis-+2p(to) = |et2-,2p(to)/*, 


but we can do things easily in the case where ty —> oo: the e21—7)% term always goes to 0 because y > 0, and thus 


we have 


2 
Buse gE yao Sax? 1 

ls>2p\ 40) — Ah We, a 2 
Indeed, if y — oo, we see that the transition probability goes to zero, which makes sense — having an infinitesimally 


short pulse basically turns off the electric field. 


30 April 13, 2021 


Last time, we discussed Fermi’s golden rule, which basically told us that when we want to make a transition into a 


continuum of states, we have (after some initial transient time) a constant transition rate 


27 
Prrans( to) = Who, w= —|Vril*o( Er) 
Es=E; 


We're now going to understand when this is a valid expression, and we will do so first by going back to the derivation. 
Recall that we discretized our system, wrote a transition probability over a sum of states, and then converted this to 
an integral 


|Vei|? 
dErp(Er) 5 F (wi, to), 


where F is a sharply-peaked function which becomes more and more like a delta function as to gets large. The 


2 
approximation we made was that our F is thin enough that p( Es) Vi iS approximately constant over the relevant part 


of the integral, and we'll now make that more precise. If we define the band width A to be the energy range around 
E; so that |V¢;|?e(E¢) changes significantly, then our condition will be that A should be much bigger than the width 


of one of the lobes of F, which we can restate as 


Le ere eee 
fi ee 


So this uy is the “transient time” before the finite-rate linear growth is apparent. But there is another term as well — 


the transition probability cannot keep growing linearly with time, and time-dependent perturbation theory only works 
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when the changes of the wavefunction from the initial value are small. So the transition probability cannot be of order 
1, and thus the other condition is ; 
Prrans(to.) K1 => to< Wy: 


In particular, we can only have both of these conditions R Kt< 4 hold if iw < A, and that’s the same as saying 
that the “energy perturbation is weak.” 

We'll now turn our attention to the regime where to gets larger than O(+) (but we still have hw < A). Our time- 
dependent perturbation theory does not work anymore, but with some care, we can still study the system. Defining 
P,(t) = 1 — Prrans(t) to be the probability of remaining in a state /, it turns out that we have the rate equation 


dP, 
dt 


= -wP(t) 


for all t = #, and thus we have P;(t) = e~”*. (And here, we can see that for small times, e~”* ~ 1 — wt gives us 


the expected linear growth, and as t gets larger, the higher-order.) 


Remark 141. The reason we're ignoring the other discrete times here is that their probabilities oscillate in time and 
are thus smaller than the linearly-growing rate of transitioning to the continuum. But the continuum transition is 
dominating here because our energy E; Is already in the continuum and because our V is constant — if we do not have 


an overlap, Fermi’s golden rule will not apply and we will not have a finite transition rate for constant perturbations. 


If we think about t >> +4 


w! 


we see that P(t) > 0, so the system transitions out of the initial state with high 
probability. To figure out where the states actually end up, we should think more about the function F again — the 
window of energies into which we make non-negligible transitions initially shrinks with time, but once the system has 
already decayed from the initial state (for t >> +), it turns out the distribution of final-state probabilities will still be 
basically centered around F;, but the width of the distribution will be roughly fw, and in fact we have the precise 


formula . 
Veil 


(Er, — £; —5&)2 + Pe?’ 


4 


Pr_i(t = 00) = 


and we have a Lorentzian distribution. Because of this, w here is called the line width (along with “decay rate’), and 
we measure the transition rate by looking at the final limiting distribution rather than trying to calculate the decay 


rate! (The 6£ term is a small constant shift due to the perturbation V, but we won't get into the details there.) 


Remark 142. The formula above is the probability of transitioning to a particular state, so if we wanted to sum up 
probabilities over all states, we would need to do an integral that involves p(Es), and we can indeed check that this 


integral is equal to 1. 


Example 143 
We'll now discuss autoionization of atoms as an application of this formalism. Consider a helium atom, which has 


two protons and two electrons, so that the Hamiltonian is the kinetic energy terms plus the Coulomb interactions 


(pe AES es 


2m" 2m |Al lél | (AA 


Pr 


(placing the nucleus fixed at the origin). 


We'll treat the first four terms here as the unperturbed Hamiltonian H®) and the last term as our constant 


perturbation V. Since H) describes two decoupled electrons in “hydrogenic atom” setups, the energy spectra of those 
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two systems are being added together, and we have energy levels 


ee 1.1 
E = 6, + £y = (-13.6 V)Z? (= + =) = —544 e&/ (= a =) 
nL) ne ons 


(since Z = 2 for the helium atom), with ground state E;,; = —108.1 eV, first excited state Fo (or E12), and then 
subsequent excited states are E31, F4i,---, up to E.o,1 (which has energy —54.4 eV. Up until this point, we’ve been 
requiring No = 1, and that’s because the lowest energy level where both atoms are excited is Eo. = —27.2 eV. So the 
point is that because we can have one of the atoms in its continuum states and the other in the ground state and get 
a continuum of energies above —54.4 eV, at this approximation (without any hybridization), the discrete level F>> 
will be embedded in the continuum. 

So we'll start our system in the initial state (2s)? (two electrons in the 2s state), and we'll try to transition to a 
state of the form 1s+free (one electron decays to the ground state, and the other electron is ejected into a scattering 
state). We know that the final energy should be approximately equal to the initial energy for this Fermi’s golden rule 
situation, so 

E, = —54.4 V+ Epes © Ej = —27.2 BV, 


and our ejected electron should have energy 27.2 eV. So in principle, we can calculate the probability of transitioning, 
and in fact the rate is pretty big because the perturbation V is not actually that small (so we're not even fully justified 
in using Fermi’s golden rule)! So we might ask how such a transition might happen in practice, and this is known as 


the Auger effect: 


Fact 144 


If we imagine an atom like magnesium with many electrons, we can imagine shooting in an x-ray and exciting one 


of the core electrons into a higher level. (This is actual ionization, rather than autoionization.) From this point, 


the atom can autoionize, so that one of the electrons goes back to the inner shell and the other (called the Auger 


electron) is ejected. 


We'll now turn our attention to the harmonic perturbations and how they cause transitions into the continuum. 
Recall that we use the perturbation 
6H(t) = 2H! cos(wt) 


(where we should not confuse w with the rate w), which is good at driving transitions from an initial state of energy 


E; into states of energy approximately E; + fw (with + being absorption and — being stimulated emission). 

We'll find that we again get a constant transition rate in those regimes, and the formalism is very similar. We'll 
do the absorption case, where we want the probability of being in the continuum around E¢ = E; + fiw: we know that 
for a general state around this range, we have 
4|H’,,|2 sin? (2¥4-* to) 


Prei(to) = 2 (we; _ wy)? 1 


where Wy; = Een so the probability of transitioning to any continuum state via this absorption can be approximated 


with an integral 
= f dEsp(Er)Preilte) = f dE re(E%) we “A = 
1 


So this looks similar to the constant-perturbation case: we have a term that depends on the specific initial state, and 
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then we have the blue term that looks like the F(w¢;, to) we had before — the main difference is that 


We; ge W 
fi _ Ai , 


so we get the same multi-lobe function but now centered at F; + fiw instead of E;. And the result we end up with is 


similar to before (this is called Fermi’s golden rule, too): 


20 
Prrans (to) =Wt, w= ce (0(Er)|H¢;|7) E;=E;+hw | 


Basically, the only differences are that we look at H’ instead of V, and we evaluate our p and H’ at Er = E; + hw 


instead of E;. And stimulated emission looks very similar, just with E¢ = E; — fw instead. 


Example 145 


We'll now study the ionization of hydrogen: suppose we shine light on a hydrogen atom, so that the electron 


goes from bound to free, flying away with some energy and momentum. We want to calculate the rate of this 


process. 


In particular, we'll assume the light comes in with frequency w high enough that it can ionize the hydrogen electron, 


which has ground state energy —13.6 eV = —Ry (Ry stands for Rydberg and is equal to 2 = $a*mc?). So 


2a0 2mao 
the final energy will be Ef = E; + hw, and the ionization only happens if fw > Ry. (We will think about this light as 
nk? 


an electromagnetic wave, but we can also think about it in terms of incoming photons.) And because Er = om We 


can also find the outgoing momentum fk for our (free) electron. 
We'll make a few approximations to make this calculation possible: first of all, we assume the wavelength of the 
~ Ww 


light A = = = 2™© js much larger than the Bohr radius ap. So the magnitude of the electric field does not vary much 


over the size of the atom, and it is roughly constant. For this approximation to work, we must have 


AX _ 2nhc 2H 1 e? 


a9 fiw ao Qa fiw ao_ 


Since £ is the Rydberg, this can be further written as 


and this is the quantity that we need to be large. That means we basically need fiw < 1700Ry * 23 keV, so the 
frequency cannot be too large. 

We'll also impose the simplification that our electron is nearly free, so that it’s not affected much by the attraction 
to the nucleus. This is basically saying that the energy of the outgoing electron should be much larger than the 
Rydberg (the characteristic energy of the hydrogen atom), meaning that hw — Ry > Ry. Putting this together with 


the above condition, we see that the range we're looking at is 
Ry < fiw < 1700 Ry. 


In particular, the ejected electron will be non-relativistic, so we won't need to worry about relativistic effects. And we 


can reformulate this condition into a condition on the wavevector k for our ejected electron as well: we have 


eke ne ke 
Er = —— =fw—Rys hw = Ry <« —— «1700 Ry, 
2m 2m 
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which further simplifies to 
hi hi k2 2 


h 
a LK IFO Ss 1S (hag S 1700, 
2ma 2m 2ma Aa) 


meaning that 1 < kag < 40. And this approximation kag >> 1 will be one that we will use in our calculation shortly 
— our job next time will be to figure out the (harmonic) perturbation from our incoming shining light, understanding 
how the Hamiltonian of our system is affected by understanding the vector and scalar potential, and then calculate 


matrix elements and the density of states to get our final answer. 


Remark 146. The reason for discussing the wavevector as well as the energy of the is that we may care about ionization 
rates in each direction of k, and because we will be approximating our continuum states as plane waves. So k will 


explicitly enter the final state wavefunctions, and we need that to calculate the matrix element! 


31 April 14, 2021 (Recitation) 


We'll discuss Fermi’s golden rule today, doing calculations a slightly different way, and then derive a few of the results 
we found in lecture. 
Today, we'll consider the probability of transitioning from a discrete to a continuous eigenstate under a constant 


perturbation 6H(t) = V. We found in lecture that we have the transition rate 


1 1 Veil? 
w= me Pi_.¢(to) = = | aE :e(E) A F(wyi, to), 
«2 wtp 
where we convert a sum over states to an integral o(E,¢) over the density of states, and F(w, to) = su is the 


sharply peaked function with lobes that we discussed. This function F has bumps at a regular period = because of 
the sin?, but the heights of the bumps fall off as = because of the denominator — the tallest one has height i 
So if we take large to, we claim that the function Flanto) behaves like 276(w). To understand this, we can collect 


some facts about the function: 


- For any w 4 0, the function goes to 0 — indeed, the off-diagonal peaks of F's oscillations have constant value 


wa and then as we divide by fo, all of the peaks go to 0. 
¢ The width of the central peak = also goes to 0. 


- The central peak has height proportional to to in the limit, in such a way that if we integrate the function over 


the central peak, we get 27. 


So the idea is that the Flute) term in our above expression w can be substituted in, and that explains the alternative 


formula that we found in lecture: we have a delta function of w¢;, which simplifies in to 
|Veil? Er _ E; 
w= [dee +) 12 276 A 


Doing this integral, the fi in the delta function cancels out one of the fis in the other denominator, and we end up with 


QT 
= =, Veil’ o( Er = Ej), 


which is indeed Fermi’s golden rule — the transition rate is linear when we transition to a state in the continuum with 
the same energy. (But still, this approximation only works when tow < 1, so that perturbation theory works, and 


when to is large enough so that 2h is smaller than the variation of the other terms in the integral V2p(E).) 
0 
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We'll now take more time to derive some results that we found from lecture: 


Proposition 147 


When t > i (so perturbation theory is no longer valid), the probability of remaining in state / is P(t) = e~“, 


\Veil 
(fa 2 oe) 4 Lae 


and the probability of transitioning to a final state P;,¢(t) is 


Proof. We'll use a simplified setup: suppose that there is a single discrete state |/) and a set of continuous states |a) 
around it, so that our perturbation V (constant in time) satisfies Vig = (/|V|a) # 0, and also that Vig # Vx, because 


V is Hermitian. 


Fact 148 
Continuous eigenstates are normalized via (a|a’|=) 6(a@ — a’), and we'll also write (a|/) = 0 (orthogonality) and 
| = |i) (J+J da|a) (a| (complete set of states). Writing a generic state as a linear combination of the continuous 


states would then look like 


I(t)) = a(t) |i) + fda C(t) Jax) 


Since we're starting in state |/), our initial conditions are cj(0) = 1 and cg(0) = 0. We had a differential equation 


for the c's before doing perturbation theory, which looked like 
eal) = 2 ae): 
n 


but this was valid only for discrete states, and we need a slightly different version of the formula when we have both 
continuous and discrete states. 


We'll now make the further assumption that V;; = 0 and Vag = 0, and then we end up with the equation 


ihc;(t) = [dace *Viaca(t 


where we have no V,;; term because V;; = 0. And meanwhile, our differential equation for cg looks like 


ie, (t) = Vet) 


where we're assuming that the different continuous states do not connect with each other at all (just to make 
the calculation easier and avoid more coupling in our differential equations). So now with these simplified equations, 
we want to get an equation for c(t), and we'll do that by eliminating c,(t) from the equation. Integrating the second 


boxed equation, we get 
VA j 7 i t 
n= “a | dt'elait c(t’), 
ih Jo 


and plugging this into our first equation, we find 
1 f ; ; 
i dV? de ete Yee), 
0 


since Vai, Vig are complex conjugates so their product is the magnitude squared, and Wj and Wa; are negatives of each 
other. We will now convert the first piece [ da|V;|? into an integral over energy [ dEqp(Eq)|Vail?, and we'll make 
the assumption now that this changes slowly over energy (just like we have previously made), though the eltialt-t) can 
rapidly oscillate with energy unless t = t’. The point is then that the rapid oscillations cancel out any contributions of 


that first o(Eq)|Vai|? part unless t = t’, and thus we can approximate c(t) = c(t’). 
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So now we can just pull that factor out of the integral to get 
1 ee ; 
a(t) = alt) — a5 f dEap(Ea)Iail? [detente 
0 


and now in the large-t limit the integral here looks approximately like the dt" elviat” (where t” = t—t’). If we integrated 


this expression over all t”, we’d get 27 times a delta function, so it turns out that what we have here is 
on - At 
| dtela® x 15 (Wig) + iP(Wia), 
0 


giving us an extra imaginary part from the principal value in complex analysis. This can be rewritten in terms of 
energies as 
= 7hd(E; — Ey) +-:: 


and thus if we plug this in and do the integral, we will find that (delta function picking out only energy E;) 


T we of 
— 5 P(E;) Vail” t= zy = Roki 


C(t) _ 
c(t) 


for some shift to the energy 6£; caused by the imaginary principal value term above. And the right-hand side is 


constant in time, so we can solve for cj(t) with a direct integration, and indeed this gives us 


Ww 
ee 


i 
a(t) =e (—Ze- F560), 
and the probability |c;(t)|? of remaining in state j is indeed e~”* as desired. And if we substitute in the form of c; into 


Voxi : J iW; t! ! 
Co(t) = ; die ett), 


we find that the right-hand side is an exponential integral, and the result turns out to be 


Vivi al-o +7 Fa Ete) )t 


ih (-“ + 4(E, — Ej) - 6E;)) 


In particular, in the large t limit (which is what we should be taking), the first term in the numerator decays away to 


0 and thus we just have 
Vovi 


* TE, — E; — 66; + Be) 


So the probability that we transition from state / to state a will be 


[Veal 
(Eq — E; — 6E;)2 + Et’ 


rg 


Prsa(t) = lar = 


as desired. (And because we have an integral over as, this is a probability density rather than an exact probability. ) 


So far, we've always been doing first-order perturbation theory, and we'll take a quick minute to discuss how we 


might derive second-order perturbation theory. The idea is to look again at the ¢,,(t) differential equation 


Meat) = > alee Vin 


n 


and substitute in c?)(t) for ém and c\ for Cn(t) (since we're basically doing a perturbation in V). So the strategy 


would be to take : y 
PDs a, dpe ete) 
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and substitute in the expression we know for c(t); we then end up with a double integral solution 


fi At Vj 
_ dt’ dt’e IWmnt Vinn elni t” Yni 
=D ik is ih: ih 


which then simplifies by doing one of the integrals to 


t [Wit 
rf dt! emt! = —1 
0 5 (Ea Ey) 


So in this second-order calculation we see again the ey term that we saw in time-independent perturbation theory! 


This can be thought of as a “virtual transition,” in the sense that V,,; takes us from / to a “virtual state” n, and then 
Vinn takes us from that state back to the final state m that we're looking at, and we might see this kind of expression 


in quantum field theory as well. 


32 April 15, 2021 


Last time, we discussed Fermi’s golden rule for harmonic perturbations, and we started a classic application of our 
formalism — ionization of hydrogen. Recall that the setup is that we have a hydrogen atom initially in its ground state, 
and then we shine light at it with frequency w. It's possible that light will ionize the atom, kicking out the electron, 
and then the final energy of the electron will be E¢ = fiw — Ry, which we can also write in terms of the wavevector as 


hk? 
2m * 


We make the approximation that the wavelength of light is much larger than than the Bohr radius ao, so that the 


incoming light wave doesn't vary much over the atom, and we also make the assumption that the Coulomb interaction 
between the nucleus and ejected electron does not affect the ejected electron. And we discussed last time that these 


two approximations together give us the range of allowed energies 
13.60 < fw < 23 kV = 1< ka < 40. 


Our next step is to understand how the hydrogen atom couples to our perturbation due to the light. Since light is an 


electromagnetic wave, we can (classically) write the E and B fields as 
E(F,t) =2Epsin(¢-F—wt), B(r,t) = 24x Egsin(¢- 7—wt) 


20 w 
Be 


where q is the vector of propagation with magnitude q = = = But electric and magnetic fields enter the 


Hamiltonian through the vector and scalar potentials 
10A = 


F=-Vo--—, B= 
™ c Ot’ 


<u 
x 


so we need to find a ® and A that work — one solution is 


2¢ = 
o=0,A(7,t) = -= Ep cos(q- F— wt). 


In particular, the divergence V - Ais zero, because it is proportional to q- Eo by the chain rule. (What we have here 


is the Coulomb gauge.) So our Hamiltonian now looks like 


A\2 
pS (54 =A) eb — © 
2m a 
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(remembering that the charge of an electron is —e), and plugging in the form of our solution gives us (after expanding 


the square and getting equal cross terms because the divergence of Ais zero) 


2 
ee ee 
2m mc 2mc? r 


2 


We'll work perturbatively in the magnitude |Eo| of the electric field — the third term proportional to A? is then quadratic 
in Eo, so to the order we're working in we can ignore it. This means we have (substituting in the expression for A) 
m2 


2 
2e = 
= =) mano Pood: F— wt). 


e 


H= HO +54 = ( 


We now place our proton at the origin, and remembering that we made the approximation that the wavelength of light 
is much bigger than the Bohr radius, that means that 7-7 7-0 =0. So we'll ignore the spatial dependence of the 


cosine, and our perturbation looks like (remembering cos(—wt) = cos(wt)) 
2e€ = _ : 
6H = ———E - Pcos(wt) = 2H’ coswt, 
mw) 


where H’ = —-<-E - p is our time-independent Hamiltonian. 


Fact 149 


If we go back to our electric field and make the approximation at the beginning, meaning E(F, i) = DEy sin(qd: F— 


wt) © —2Epsin(wt), we can then use a different gauge and choose 


= 


=2E£)-Fsinwt, A=O0. 


We then find that the perturbation ends up being 
6H = —e® = —2eFy -Fsinwt =2A’sinwt, A! = —eEy-F. 


So this time we have a sine instead of a cosine, but it doesn’t matter in the derivation of Fermi’s golden rule (since we 
write oscillations as a sum of exponentials, and also because the phase shift doesn’t make a big difference over long 
periods of time). That means this is also a valid way to proceed. 

This gives us B = 0, but that turns out to not be a problem — the effect of the magnetic field on the electron 
is much smaller than the effect of the electric field for an electromagnetic wave like this, because the Lorentz force 
is F = —eE — ez x B, and in the units that we used the magnitude of electric and magnetic fields are equal. (So the 


non-relativistic nature of the electron makes the effect of B suppressed. ) 


Proposition 150 


The perturbations H’ and A’ lead to the same transition (that is, ionization) rate. 


Proof. Recall that for harmonic perturbations, the rate of upward transitions takes the form 


20 ; 
w= = (FHL) o(Er) 


E;=E\+hw 
So what we need to show is that the matrix elements |H,|? and |H';|? are equal, which means we need to compare 
(f\pli) to (flr). We can understand this by remembering that |f) and |/) are taken to be eigenstates of the 


unperturbed Hamiltonian 


where we have the (verifiable by calculation) commutation relation 
ih 

HO) 7] ==—p. 

[a] SS — eB 


So now we can get our answer by noting that 


ean = (4 aj) = (ear rH), 
and now we can apply H©) on the bra and ket to find 
= (Er — Ej) (F Ir), 

and thus we've found a relation between those matrix elements! And now we can just plug these in: we have 

a e€ = a le - ; 

(F|H |i) = Fy £9" (Fpl) = — 7% (EF = Entei lr). 
and writing this in terms of the matrix element for A’ gives us 
Ef 


je p= Ej Ap): 
= FE (ran) 


And because we evaluate this expression at EF, = E; + fw< the prefactor is just /, and thus the squared magnitudes we 


get from H’ and A’ are indeed equal. So our Fermi’s golden rule answers are the same in both cases, as desired. 


We'll often use the A’ gauge in the future when doing computations — it’s a bit weird because we're setting B = 0, 
though. (In both cases, this sequence of approximations is called the electric dipole approximation. ) 
So now we need to actually find the transition rate w: to do that, we will calculate H/, and also p(E;). The form 


of the wavefunction for the initial and final state are 


li) = Wro0(F) = 


(the ground state of the hydrogen atom), and 


(approximating the scattering states as free electrons, which are in plane waves). Remember that we’re using a cubic 


box of size L? so that we can calculate the density of states as well, and the energy of our final state is re (in the 


approximation where Ef >> Ry). So then the matrix element we want is 


Hp, = (FAL) = (bg 


Cs et es, 
eee =i B|¥r00 ) ’ 
mw 
and now because the plane wave is actually an eigenstate for p, we can apply p on the bra to get 


hE, 
EIR jet asl 


and then the remaining overlap can be computed with the integral 


(bg| ¥100|=) / Pr ik Fant /a0. 
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We can evaluate this by going to polar coordinates, and it turns out the answer is (the dimensionless) 


2 
pee” ee (Rag) 


Remembering that kao >> 1 in our “free electron” approximation, we can neglect the 1 in the denominator and rewrite 


this as 
- 8/nr 1 
™~ "13/2 5/2° 
L3/ k4 ap! 
Substituting this back in, we find that because hw + fiw — Ry = ae we know that the mw can be replaced by ug 
in our calculation for H;;. So 
Eo-k8/n 1 


So the rate of transition here depends on the outgoing electron’s wavevector — in particular, it depends both on the 
angle (because of the Eg - k term) and on the magnitude (because of the zz). If we write Eo - k = Egk cos@, we can 
collect terms here to find that the matrix element is 

H,, = —16.\/1(eLp ag) cos 9. 

fi V1(eE ao) Kail? 3/2 

Now eF gap has dimensions of energy, and the rest is dimensionless — if we square this expression, we get one term in 
the transition rate w, and the other term we need to find is the density of states p(E;). But we calculated that for 
this kind of setting already — the density of states per solid angle for a free particle in a cubic box satisfies 


(jal = *™ ea0 
PEA \ on) Fe 


So we can now take everything together and plug it into Fermi’s golden rule: the rate into a solid angle dQ looks like 


on 27 1 L\*m 
dw = ——|Hpi|*o(Er) = = (256m(cF 20) co Orar7s J . (=) ae 


and now (as expected) the L factors cancel out, and collecting all other terms gives us 


a 64 (eEgao)* 1 
~ ahh /(mag) (kao) 


cos? 6dQ. 


So we've now found the partial ionization rate 


dw 64 (eEoa)* 1 
dQ wh h?/(mas) (kao)? 


cos? 6, 


and in particular the cos? @ term tells us that the strongest ionization occurs parallel or antiparallel to Ey. The units do 

indeed work out (the denominator ae is twice the Rydberg), and we can notice that as we increase the momentum 
0 

of the final particle, there is a steep dropoff for the ionization rate. And if we only care about how many electrons are 


ejected per unit time, we have 


and since { dQcos?@ = f sin@d@ [ dpcos? 6 = *Z, our full ionization rate is 


= 256 (eEgaq)* 1 
3h? /(map) (kao)® | 
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(We should remember that we've defined Eo so that the electric field amplitude is 2Eo, so we should be careful if we're 


consulting other references in the literature.) 


Fact 151 


If we do a similar calculation with He: instead, using the same plane wave for our final state, it turns out that 


we actually get a difference by a factor of 4. The reason that we get this discrepancy is that the final state is 
an approximation and not an exact eigenstate of H), and it turns out that the correct answer comes up from 
H' but not from A’! So we need to include corrections to the scattering states due to Coulomb interaction if we 


want to use perturbations with 7 instead of p. 


We're now going to switch topics and discuss interactions of light with atoms that cause transitions between 


discrete levels, rather than from the discrete to the continuum. 


Example 152 


Suppose that we have an ensemble of atoms that are interacting with light, and there are two particular bound 
Eye 


states |a) and |b) with energies E, < Ep. We define wy, = Eo Ea > 0, and we shine light on the atoms with 


frequency Wp. 


As previously discussed, there are two processes that occur; we'll describe them using photons even though we'll 
be doing calculations with waves. In absorption, we shine a photon of frequency w & Wp, which causes an upward 
transition for an atom from |a) to |b) — the atom then absorbs the photon. Meanwhile, in stimulated emission, we 
shine in a photon of frequency w, and that stimulates the atom to transition down from |b) to |a), and then two 


photons are emitted (for conservation of energy). 


Remark 153. Lasers (“light amplification by stimulated emission of radiation”) work in this way. Basically, we have 
an ensemble of atoms that are initially prepared in the excited state |b), and we send in a single photon. Once that 
photon causes an atom to stimulate a downward transition, it releases more photons, and this causes a chain reaction 
of more and more photons. So the trick is to get the atoms initially in |b), and this is called population inversion — 


usually we do this with a third energy level. 


From here, we'll think about our ensemble of atoms as living in a furnace of some fixed temperature T — there is 
radiation inside the furnace, and we'll treat it as blackbody radiation. The idea is that after some interactions between 
light and atoms, we will achieve equilibrium, and we will see that spontaneous emission will also occur. But we'll 


discuss this in detail next time! 


33 April 21, 2021 (Recitation) 


Today's recitation is being taught by Cagin Yunus (the TA for this class). We'll cover three main topics today — we'll 
review the ionization of hydrogen, do a related problem, and understand the effect of electron spin on the transition 
rates between states. 

Recall that the setup for ionization of hydrogen is that we have a hydrogen atom, in which we make two simplifi- 
cations: first, we assume that the wavelength of the incoming light A is much larger than the Bohr radius ap, so that 
the magnitude of the electromagnetic wave does not vary considerably on the scale of the atom (and thus we don’t 
need to worry about the spatial dependence of the EM wave). This is the condition that fw < 23 keV = 1722 Ry, 
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or equivalently that kya >> 1 (where ky is the wavenumber for the incoming light). But also, we assume that the 
ejected electron has high enough energy so that it is in a free-particle state, which implies that fu — Ry > Ry; we 
often rephrase this as hw >> Ry or that ke >> 1 (where ke is the wavenumber for the outgoing electron). And thus 
we have the allowed range 

Ry < fw < 1722 Ry, 


where we can say things like “1 < 10” for purposes of our approximation. 
Our interaction picture Hamiltonian can then be obtained by taking the gauge where 

= Cus an 

A(F, t) = ~ 7 2Fo cos(ky-7F—wt), (Ff, t)=0, 
so that the perturbation is given by 

e = = 
6H = ——2Ep- poos(ky -F— wt). 
mw 

We now use the first assumption to say that the argument inside the cosine is essentially cos(—wt), since the first 


term ie - F is very small. So now our Hamiltonian takes the form 6H = 2H’ cos(wt), and in this case we have 


e = 
ey ee 
mJ 


(Recall that the reason for the factor of 2 is that cos(wt) = 5(elve + et), and each exponential corresponds to 


a mode of emission or absorption.) And now we can use Fermi's golden rule, calculating the matrix element: if we 


assume that our electric field is E = E92, so that H’ = aL then we want to find the matrix element between the 
mw 
states i f 
Ae _ —r/ao _ 2 Wee iKe-F 
l/) l100) Jra ’ If) — \vz) = [3 , 


where L is the “large box” that we put our free box in and ke is the outgoing wavenumber for the electron. The matrix 
element then looks like _ 
: e€Lo . 
He; = (F|H'|/) = ates (f|pz|i), 


and now because pz is Hermitian, we can essentially apply it on the bra: 
eF : 
= —— fk. cos 6 (f Ii) , 
mw 


where @ is the angle between ke and E. We can then compute this matrix element by integration and use our other 


simplification: 


8rae 
(fli) = ——2__. & 8a, 
(1 + k2a3)? 2 
so that the matrix element is ‘ 2K 
cos 
Ve= "16 E, , fw= ——. 
fi /1(eEoao) [3/2ai™ 5 W = 


Now because the density of states per solid angle is p(E.) = Hem kedQ, we find that 


dw _ 
dQ 


as 64 mas (€9a8) cos? 6 
— p(Ee)|H¢i|? = oe 
hi p( e)| Fil n he A (keag)?' 


and the integrating through tells us that the transition rate 


3 8 koa 


_ 256 m e? EG 
We'll now solve a related problem: 
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Example 154 


Suppose we're in the ground state of the hydrogen atom Hamiltonian, and we're subjected to an external potential 


of V(X, t) = Vo cos(kz — wrt). 


We want to know the transition rate for the electron to be emitted with the momentum /, and we want to compare 


this answer to the photoelectric effect calculation we did above. First, we will expand our potential as 
o Vor lies ioe) —i(kz—wt) 
V(X, t) = 3 [e’ Z—W + e WKZ—W : 


where the two terms correspond to absorption and emission (though there is no longer any light to absorb or emit, 
we can still think about our system as gaining or losing energy). We'll focus on the first term of absorption, and then 


the transition rate will be given (again by Fermi’s golden rule) by 
2 
Wei = = (Er )|Veil?. 


We need to calculate the matrix element here, and much like before, we have to integrate the wavefunctions across 


space: 


Ve; = Vo 1 dexteTiBx/f gr! /a0 eikz! 
2 \/mae 
0 
where e’*2' is the term from the potential. If we make a change of variables here so that 7 = 2 _ «3, then our integral 
g i g 


(without the leading constants) looks like 
/ Aexiel® eo /40 — i: sin d0dg(r')? dr’ ef” $88 e— 1/20 


3 
4nay 


Tangy? If we plug this back in with our density 


by switching to spherical coordinates, and we can evaluate this to get 


of states p(Er) = Bk dQ, where kp = g we find that 


dw mVZab ke 


dQ wh? (1+ a§q?)4 | 


We can notice three main differences in this answer compared to the one above: 


- There is no cos? @ factor, because there is no polarized electromagnetic wave. 


¢ We do not make the approximation that agq < 1, so we can't get rid of the denominator here. This is because 
there is no “free electron assumption” that we can make this time. So in this problem, we can consider ejected 


electrons with small momenta, when we could not do so for the photoelectric effect. 


« In this case, we use g = ke — kZ, where kp — B But in the photoelectric effect above, we essentially ignored 


the second term kZ because we had the corresponding assumption ky - a9 < 1. 


Example 155 


We'll now move to discussion of spin contribution, thinking about coupling between spin and magnetic field and 


explaining why we can neglect it in the photoelectric effect calculation. 


Our perturbation due to spin looks like 


sa jhsa— 638 Ses. 
mc 2mc 


h 
a" a-(Vx A), 
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and now we can use the identity that V x (f¢) = Vf x ¢ (when C is a constant vector and f is a function) and plug 


in Our previous vector potential A with €= E> to get that 


B=VxA= <2sin( key -F—wt)ky x Eo. 


Here, we'll assume that E = Fo2Z Is pointed in the z-direction, and ky = k,y is pointed in the y-direction. Then we 
have 


ae oe “ 
B= en oky sin(ky - 7 — wt), 
so that our Hamiltonian changes by a term 


eh ee 
6H + OH+ mu HOM sin(ky -7—wrt)ox, 


O 1]. , : ets ; < 
where oy, = f ; is the Pauli matrix. If we assume for example that our initial spin of the electron is in the 


z-direction (so that it’s perpendicular to B), then we can calculate OH¢+,i,, because that will let us know whether the 
effect of coupling is strong enough to change the spin. (The original 6H term does not contribute to this matrix 


element because the overlap of t with | is zero.) To do this, we expand the sine in terms of exponentials again as 


sin(ky -7—wt) = > (ei VS eee aD) 


with two terms corresponding to absorption and emission, then finding the matrix element requires us to compute the 


1 e?h? 647 
Ly 2 2-2 
|6 rid ~ 4 mew we ky 7 (cee) , 


where we use the approximation ke - a9 >> 1. We then get the transition rate, and it turns out that the ratio between 


same integral, and we find that 


transition rates for a flipped spin versus no flipped spin is 


2 
wit 36 oy 
Weil 4 k2 / 


where the last inequality comes from ky - a9 < 1< ke- ap. So the spin contribution can indeed be neglected! 


34 April 22, 2021 


We've been discussing the interaction between light and atoms, and we'll continue that discussion today. In particular, 


we'll study an argument of Einstein's about how light and atoms can be at equilibrium at some temperature. 


Example 156 
Consider a cavity at some temperature T. There is electromagnetic radiation inside the cavity, which we will 


model as blackbody radiation. This cavity also contains atoms in equilibrium with it, and each of those atoms has 


two levels |a) and |b) with energies E, and Ep (we'll say there are Nz of them in |a) and Np of them in |b)). 


We'll work with the following three facts: 


¢ At equilibrium, Nz and Np, do not change in time, meaning that N, = Np = 0. 


Poe ‘ —BE, = * . . . 
« At equilibrium, we have Me = Cor = e 6M». _ that is, we obey the Boltzmann distribution. Here, 6 = ar 
a 
where kg is Boltzmann’s constant, and wy3 = Enos 
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- At temperature T, we see blackbody radiation satisfying Planck’s blackbody law: 


Radiation energy per unit volume of frequency € [w,w + dw] = U(w)dw = Pa id 

We need to understand how this thermal equilibrium is maintained, and to do so, we need to understand what 
processes go on inside the cavity. We've already mentioned two of them: we have absorption, in which an atom starts 
at level |a), a photon comes in and is absorbed, and the atom ends at level |b). We also have stimulated emission, 
in which an atom starts at |b), a photon stimulates a downward transition to |a), and two photons come out. 

We want to know the rate at which each of these processes occurs: for absorption, the rate should be proportional 
to the number of photons, which is proportional to U(w»,), and also proportional to the number of atoms N, as well 
as some temperature-independent quantity Bay (which is inherent to the atom and tells us how likely the stimulation 
is to occur). In other words, the rate of absorption looks like Baz,U(Wy3)Nz. And similarly, for stimulated emission, 
we expect that the rate for the process is proportional to U(wp,) (number of photons of the correct frequency), Boa 
(intrinsic to the atom), and Nz, (number of atoms), so we'll write it as ByzU(wpa) Np. 

But these two processes are not enough to maintain equilibrium, and let’s see that now. At equilibrium, we need 


the rate of change of Nz and Ny, to be zero, meaning that the total rate satisfies 
0 = Np = BabU(Wpa) Na — BoaU (wba) Np, 


which simplifies to 


N 
N,U(Wpa) G ina Bont) =0, 
a 


so that (using our Boltzmann statistics assumption) 
Bap — Bpae PMs = 0. 


But this doesn’t make sense, because Bz, and By, are temperature-independent, and the exponential does depend 
on temperature! So this cannot be true for all temperatures T, meaning that we cannot maintain the system at 
equilibrium. 

What solves this issue is adding a new process called spontaneous emission: an atom starts at |b) and drops to 
|a), emitting a photon, without external stimulation. This rate is then not dependent on the radiation in the cavity, 
so the rate will be AN, (proportional to some intrinsic property of the atom and the number of atoms). Here, A does 
depend on which levels |a) and |b) we're in, but we're just considering two states in the atom for now. 


So now we can write down an equation for the total rate of change: 
0 = Np = —ANp + BapU(Wpa)Na — BoaU (Wa) No. 


Rearranging this, we find that 


N A iL: A 1 
A= BapU (Wa) a = BpaU (Wa) = U(Wpa) = Bap Na = Bea = Bap cBtwna — Boa 
b ab ab 


But we also know the formula for U(w) from our starting assumptions, and we can now compare the expression we 


got with 
3 
W 
ey Tce 1 
A fh 3 : . ! 
We then learn that | By, = Bap | and also that ae aaa ba . In particular, notice that at high temperatures, there 
ab 
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are many photons around, and thus stimulated emission will dominate. Specifically, if kg7T is larger than the relevant 
energy scale fw pz, then stimulated emission is the primary process, but otherwise if kgT < fiwp,, then spontaneous 


emission will dominate instead. 


Fact 157 
Historically, Einstein knew about the existence of spontaneous emission and absorption when he made the original 


argument, and he used this calculation to show that stimulated emission must also occur! 


Example 158 


We'll now compute By, explicitly and use it to find the rate A of spontaneous emission. 


Remark 159. Computing A explicitly on its own requires us to quantize the electromagnetic field, but we've been 
treating electromagnetism classically in this class. (We should think of spontaneous emission as being driven by 


quantum fluctuations of the electromagnetic field, which are present even in vacuum.) 


So now let's work on our problem: we have atoms interacting with thermal radiation at temperature T, and we want 
to calculate the stimulated emission rate per atom B,,. We know that the electric field is the primary contribution to 
the interaction of light and atoms (because the interaction with the magnetic field is suppressed by a factor of 4 « a 
compared to the electric field). We'll also assume that we're working with optical transitions  € [400 nm, 800 nm], 
in which the wavelength of light is much larger than ao (so it is much wider than the atomic cloud). That means we 


can write the effect of the electromagnetic wave for a particular mode as 
E(Fr,t) = E(t) - # = 2E 9 cos(wt)A, 


where 77 is the (unit-length) vector of polarization. This corresponds to a vector and scalar potential of 


o(7,t)=—7-E(t), A=0, 
so that V = E and our perturbation is 
6H = qO(F, t) = —qr- E(t) = —qF- FE(t), 


where g = —e Is the charge of the electron. We can then define the dipole operator 


> 


d=q-f, 


which points from the negative charge (the electron) to the positive charge (the nucleus of the atom). And thus we 
can rewrite 
6H = —d- E(t) = 2(—d- AE) cos(wt). 


(this is indeed the energy of a dipole in an electric field). This is a harmonic perturbation, which we write as 2H’ cos(wt) 


for a time-independent Hamiltonian | H’ = —d’- WE |. 


So now we calculate the rate of stimulated emission due to a particular mode of light by calculating the probability 


P2_p(t) associated to it: by Fermi’s golden rule, the transition occurs with probability 


AlH!.|2 sin? (Yea ¢ 
Px—p(t) = ol a 


(Wea — w)? © 
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The matrix element in this expression is 
Hl, = —Ep (ald Alb) Sie Rs (aldjb) SS Edy. 


where 
day = ((ald*|b) , (ald”|b) , (d?|b|)) = a((alx|b) , (aly|b) , (alz|b)). 


So if we plug things back in, the probability of making a downward transition due to this particular mode after time t 
iS 3 | 
=  _4sine (S2—*¢ 
P3_p(t) = = dab ; Ail?——2 
h (Wha _ w) 


It's convenient to rewrite this formula in terms of the energy carried by the electric field per unit volume, which is 
|E|?  4E2 cos?(wt) 


= = 1 22 
er ae = 5q Fo cos‘ (wt). 


2 
We have a rapidly oscillating cosine term, so on average (over time) this energy per unit volume is Ue = a But 
electromagnetic waves also have a magnetic field, and recall that the energy in the electric and magnetic field are the 
same! So Ug = Ue, and thus the total energy is u = a EG. (Even though the atom mostly interacts with the electric 


fields, the energy is still equally distributed between the two fields.) Therefore, our probability of transition is 


4 Sn gsine {Meee 
Px-p(t) = 45 (2mu(w))|dap - Al” Gores, 


(Wha _ w)? 
for a single mode of frequency w and polarization 7. And because our atom interacts with an ensemble of modes of 
electromagnetic radiation — we'll basically see a continuum of light modes inside the cavity if it is very large — each 
mode w; and corresponding polarization 7; will have an amplitude Eo(w;), and the different modes are uncorrelated. 


So we need F 
Wha 
5sin (Seas t) 


PLEB(t) = DO ge Qmulen)) dan IPT 


and we now need to perform the sum over all modes, where we're summing over frequencies and averaging over 


polarizations (to fit with the normalization of u(w) above). For each frequency w, we therefore need to find the 


average ( 


a 2 
dab - il ) by doing so over the unit vector ;, and we can write this out by components as 


zh S> bya [\- (ra yt D(a) = dA bp)’ day (ri n*). 


J=XY,Z 


But when j # k, by reflection symmetry the average is 0, and for the others we know that 


(they must all be the same and they add to 1 because f7 is a unit vector). Thus, 
fay Fl = tei)" OK = Fld! 
da ab ab’ 3 3 a , 


so that if we plug this back into the total probability of transition, we have 


(Wha — Wi)? © 


PHO) = 3 3 pe row) 
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The sum over / is now a sum over frequencies: 


 8r|dapl? sin ie “i t) 
~— - 3R2 d u(w Rear a —wy)2 ' 


and now we can approximate this as an integral 5>; u(w;) + f dwU(w) (since U(w) is the radiation energy density) 


d, 2 en Wha W 
— Baldasl” [out Zan 


3h? (Wpa — w)? 


We'll use the usual strategy here: because the function in the fraction is sharply peaked around Ww = Wp, we can 


to get 


approximate this integral for large enough t as 


81 dap? sin? (22—" t 4n?|dap|? 
= a U(w ww) [ dw a = aHE U(wpa)t 


(we've done the integral before and it evaluates to my). So even though each individual mode contributes to our 
probability in an oscillatory way, again the sum over modes gives us a positive transition rate, much like Fermi’s golden 
rule (even thuogh we're dealing with two discrete states here)! So this gives us a stimulated emission rate 

An? | dap|? 


3 U(Wba) = BoaU (wpa), 


Warb = 


and now we have the expression we're looking for: 


2 => 
dap? 


T 
= ap! 


(Indeed, we notice that By, = Bay.) And Einstein’s relation gives us the final answer we're looking for for spontaneous 


emission: 


3 
= Alba? i 4 wy, 
W203 °° 3 ic? 


| aol? | 


At the end of the day, we’ve found that these transition rates only depend on the matrix element dei. 


35 April 26, 2021 (Recitation) 


We've been talking about Einstein's A and B coefficients and making dipole transitions in the atom (absorption or 
spontaneous/stimulated emission). Today, we'll do some review of this, and then we'll derive some selection rules to 
understand which transitions are actually allowed by the dipole Hamiltonian. 

Recall that those three processes above can be described in terms of light and atoms: there are two states |a) and 


|b) separated by an energy fiwpa. 


« Absorption occurs when an incoming photon with frequency approximately wp, causes the atom to move from 
|a) to |b), and the transition rate here is N,;Bz,U(Wpa), where N, is the number of atoms at a, U(w»,) is the 


density of photons at this frequency, and Bz, is some atom-dependent quantity. 


+ Stimulated emission occurs when an incoming photon causes an atom to move from |b) to |a), and then 
(because of energy conservation) two photons are then emitted outward as radiation. This rate is very similarly 
NpBpaU (Wa). 


+ Spontaneous emission occurs without a photon: an atom goes from |b) to |a) and emits one photon. This 


124 


rate is independent of the presence of photons, and it is NpA. 


In lecture, we then went through Einstein’s argument using statistical equilibrium and balancing the combined rate 
of transitions. We need N, = Ny = 0, and we know that Nz and Nz, have some ratio at equilibrium, given by the 


Boltzmann distribution e~®"™ (where B = rae We also have the blackbody radiation formula 


W2 


By ace aoe 


Plugging everything in, we ended up with the result 
Bap = Boa, A=B 


and we can find more explicit solutions by looking at dipole transitions: if we think about an electric field E(t) = 


2Egfcos(wt) for a given field being applied (where we use the usual approximation to drop the spatial dependence), 


we can use the usual Fermi’s golden rule on 65H = —d- fi(2Ep) cos(wt) for the dipole moment operator d = g?. We 
then have H’ = —d- AE, and we can write this in terms of the energy density u = Fi (which depends on w because 


we generally have a spectrum of different frequencies in our electric field, with different magnitudes). Reading off 
coefficients then allowed us to find Bay, Bya, and A: 


2 3 
An = _ 4 Wy, 


ape lal’, A= 3 ca | ol 


where we have the dipole matrix element dap = (ald]b) = q(alr|b). 
We'll now talk more about these matrix elements in detail: we'll work out a lot of them in our problem set, 
but there's a few general rules that will help us out. We want to think about the hydrogen atom, and we want to 


understand when we can go from |n’, £’, m’) to |n, 2, m). 


Problem 160 


When are the matrix elements 


(n, £, m|F|n’, 2, m’) = f Prem APOn, 0.00) 


nonzero, meaning that the corresponding transition is allowed? 


Our first consideration is parity: if we look at the change of variables on the right-hand side replacing 7 to —r, 
then the 7 gets flipped under parity, while the d?r stays the same. Since each Wy¢m state is a definite parity state 


in hydrogen, this means the product w (F)Wne.m(r) must be odd (or else the change of variables gives us the 


* 
né,m 
negative of the original integral, meaning it was zero), which is the same as saying that the states must have opposite 
parity. 


And because parity is (—1)* (the orbital part is what matters here, since the radial part only depends on magnitude), 


what this means is that 2 and & must be of opposite parity, or in other words that |£+ 2’ is odd|. This is an example 


of a selection rule: it for example forbids us from making a transition 2 = 0 to 2= 0. 


But it turns out that we can be more specific: it is required that we instead have | Ag = £— £’=+1). To show 


this, we should recall our earlier work with vector operators: we showed 


1 
(21h)? 


= “ . 1 = 
[FP [FV =(V- AF (Pv +VJ) 
for a vector operator V. We'll apply this for J = L (the operator which causes rotations) and V = fF, and we can 
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notice that 


Our result then ends up being 
[Cie naar (rer), 


and we basically want to take matrix elements of this formula to get useful results. We can write out the commutator 
on the left-hand side: we find that 


(L?)?r — 20? rk? + r(L?)? = 2n?(L?r + 7L?). 


If we put a (n, 2, m| on the left of both sides and a |n’, 2’, m’) on the right of both sides, the L2 operators can be 
applied against the bras and kets to get numbers (because we have eigenstates of L?, specifically with eigenvalues 


h£(2+1)). So the left-hand side simplifies to (noticing that we have a perfect square) 
ft (&(@ +1) —2( +1))? (n,2, mA’, £m), 

and similarly, the right-hand side simplifies to 
= 2n* [ee +1) 4+ 2 (e +1)] (n, 2, mri’, £m’). 


Simplifying, we find that the matrix element factors, powers of fi cancel out, and what we have left is 


(n, 2, m|7|n’, £, m') (4+ 2 +1)? — 1) (@- 2)? — 1) = 0}. 


So either the matrix element is zero, or 2 = £/ = 0 from the second term, or | — @’| = 1. But we already showed by 
the previous selection rule that 2 = 2’ = 0 doesn’t work, so indeed we recover the result promised: transitions can only 
occur when | Ag = +11]. 


Remark 161. The simplest way to think about this result physically is in terms of total angular momentum conservation: 
making the dipole approximation to the photon means the only angular momentum that’s going on here Is in the photon 


and the original state, and photons have spin 1. 


But we can also say more about the other quantum number m and try to understand Am = m— 1m’. We'll do that 


one component at a time. 


+ For the z-component, notice that [Lz,z] = L,z — zL,0, so if we take matrix elements of this expression 


(remembering that |n, 2, m) is an eigenstate of L, with eigenvalue fim), we find that 
(m—m)fA(n, 2, mz|n’, 2, m') = 0, 


so the only way to get a nonzero z-component of the matrix element is if Am = 0. 


+ For the other components, we can notice that 
[Lz,x] =ihy, [Lz, y] = —/fx 
(essentially, a rotation in z turns x into y), and taking matrix elements on both sides will give us 
(m— m)A(n, 2, m|x|n', 2, m’) = ifn, 2, mly|n', £m’), 


(m—m')h(n, 2, mly|n’, 2, m’) = —if(n, 2, m|x|n!, 2, mm’). 
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We can then solve for the two unknowns (matrix element of x and y), and we find that 
((m— m’)? — 1) (n, 2, mly|n’, 2, m’) = 0, 


and also an identical formula for the x-matrix element. So that tells us that the only way to get nonzero x- and 
1. 


y-components are if Am = 4 


So putting everything together, the only nonzero matrix elements for F (and thus the only nonzero dipole matrix 


elements) is if A€ = +1 and Am = 0,+1. But the levels in n can jump in arbitrary ways, since those transitions depend 


on energy differences and those are dependent on the energies of the incoming photons. 


36 = ©April 27, 2021 


Last lecture, we discussed Einstein's argument for the interaction of light and atoms — we discovered that in addition 

to the absorption and stimulated emission occurring when we treat light like a classical wave, we have an additional 

process, spontaneous emission, that also occurs without any external stimulation from a light source. We deduced 

with Einstein's argument that the rate of these spontaneous emissions is 

_ 4b, 
3 hic 


A laaals 
where dap is the matrix element (b|qrja) of the dipole moment q7 between our initial and final states. Since this is 
a decay rate, if there is no external radiation (light) source, then the number of atoms N,(t) in state b satisfies the 


rate equation 
dNp 


dt 
meaning that T = 5 is the lifetime of our excited state |b) if it can decay to |a). More generally, we may have a bunch 


=—AN, — N,(t) = Np(0)e*, 


of different decay rates A; because |b) can decay to a variety of other (lower-energy) states, and thus we can write 


down a total rate Arora) = Ar +--- + An. The lifetime involved then satisfies 


(If there are multi-step decays from |b) to a lower-energy state, all that matters is the initial decay rate.) 
We've mentioned that spontaneous emission is accompanied by a photon, and indeed we can look at the frequency 


of the outgoing photons in such a process. 


Fact 162 
Much like in Fermi’s golden rule, the frequency of the emitted photons is strongly peaked around Wy, with width 
A: specifically, the distribution function is the Lorentzian 


A 1 


2m (w sa Wab)? oF a 


P(w) = 


In fact, calculating this width is the usual experimental method for extracting the decay rate. 


(This is the reason we often call A the line width.) For an atom like hydrogen, the electric dipole transition rate 
(recalling that our formula for A above assumes that the electric field contributes much more to the rate than the 
magnetic field) goes as 


hA ~ abmec?. 
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Comparing this to the main energy scale in atomic physics, which is the Rydberg 50° mMeC?, we see that fA < Ry. 


In fact, the fine structure scale Ey; ~ a*mec? 


, sO we even have HA < Fy. So this means that resolving the energy 
difference from fine structure effects is not inhibited by the line width! 

We know that the decay rate is only nonzero (to the order that we’re considering) if the matrix element (b|7|a) 
is zero, and there are certain selection rules for when the matrix element between two states |n, 2,m) and |n’, 2’, m’) 


is nonzero. As derived in recitation, we have the following results: 


Proposition 163 


The matrix elements (n’, 2’, m'|r|n, 2, m) are zero unless £’—£ = +1, and in addition (n’, 2’, m’|z|n, 2, m) = 0 unless 


n’ =m, and (n’, 2’, m'|x|n, 2, m) = (n’, 2, m'|y|n, 2, m) = 0 unless m’ — m= +1. 


This concludes our discussion of light-atom interactions and time-dependent perturbation theory, and our next 
topic will be the adiabatic approximation. Here, we continue to study quantum systems with a time-dependent 
Hamiltonian H(t), but we will assume that it varies slowly as a function of time. (This is a setup that often makes 


sense for cold-atom systems, and it is also used for adiabatic quantum computation.) 


Example 164 


The variation of the Hamiltonian may come from an experimentalist changing a parameter in a lab. For example, we 


may have a pendulum in which the length is changing, so that the Hamiltonian looks like H(t) = ie oF $muw(t)?x?. 


We wish to figure out how to quantify the “slowness” of the change of H(t), which comes from the slowness of 
the change of w(t). To do so, we compare to the fundamental time scale of the system T = 2m the period of the 
oscillations. It then makes sense that we have a slowly-varying function if over each period of length 7, the relative 


change in w is small: another way to write this is that 
|T-wl<w. 


Plugging in our expression for T, this can be rewritten as 


d/l dT 
<1 => |—[- J) «<x1 SS |— «Kl |. 
dt \w dt 


W 
W2 


This is a nice result because it’s unitless, and it’s reminiscent of the condition we had in the WKB approximation, 
in which | | < 1. Notice that we're not making any assumption here that the change in the Hamiltonian is small, 
though — over a long time period, H is allowed to change dramatically. So this is different from perturbation theory in 
that dH(t) is not assumed to be small! 

We'll be talking about adiabatic evolution for quantum systems in this class, meaning that we want to understand 
how H's slow variance affects the Schrodinger equation ing |a(t)) = H(t) |W(t)). The following concept is particularly 


useful in this setup: 


Definition 165 


The instantaneous eigenstates and instantaneous energies of H(t) are the wavefunctions |W,(t)) and eigen- 


values E,(t) satisfying 
H(t) |Wn(t)) = En(t) [Wn(t)) . 


We often draw “level spaghetti” diagrams, plotting each E, as a function of time. We've mentioned previously that 


in general, these instaneous eigenstates don't help us with time-dependent Hamiltonians, but we'll see now that they 
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are useful when our Hamiltonian varies slowly in time. But we do need to avoid degeneracies: we will assume that the 
levels do not cross, so that we have 
Fo < Ey < Eo <--: 


and we can choose an orthonormal basis |w,). 


Theorem 166 (Adiabatic theorem) 


If our system is in an instantaneous eigenstate |w,(0)) at t = 0, then 


|sp(t)) © cn(t) [n(t)) 


for all t, as long as H(t) changes slowly. In other words, the probability of transitioning to a different state goes 


to O for a varying-slowly-enough H. 


Proof sketch. We'll denote the full wavefunction solution by W in this proof to avoid confusion. We expand the solution 
|W(t)) in terms of the instantaneous eigenstates (we have completeness of the orthonormal basis at any point in time, 


because H(t) is a Hermitian operator) 


[w(t)) = $5 cn(t) |ba(t)) . 


n 


Plugging this into the time-independent Schrodinger equation, we have 
oO : ; 
me |W(t)) = pe Cn |Wn(t)) + Col Wn(t)) 
by the product rule, and we know that this is equal to 


= H(t) |W(t)) = S> En(t)ca(t) lbn(t)) - 


We can thus manipulate this equality to get an equation for each ¢,(t) by applying a (wx(t)| for some fixed k on both 


sides. By orthonormality, we find that 
iN, + 11S ~ colt) (be |Wn) = Ex(t) cx. 
n 


We'll split this up as 


inex = (Ex(t) — i (We |We)) c(t) - iny- (bk|Wn) Cn(t). 
nZ#k 


We wish to show that the equations for various k decouple, because that means that our system will remain in the 
instantaneous eigenstate |,(t)) if it started off in |w,(0)). So we want to the “transition-causing terms” to be 
negligible, and we'll assume that we can neglect them for now (and return to it at the end of the discussion). Then 
we find that 


c= (—FE R(t) — (ele) oe > ext) =e (—5 [at'Ex(e)) exo (— fat’ (velve)) cx(0) 
Introducing the notation 
(t= — 5 f dvER(t). v(t) =i (eld). w(t) =f dtn(e, 


we find that 
Cx(t) = exp (10x (t)) exp (/¥K(t)) Cx (0). 
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In particular, if c,.(0) = dmx (meaning that our state started in one of the instantaneous eigenstates), then c,(t) = 0 


for all t unless m= k, and thus our wavefunction takes the form (under this approximation) 
Y(t) eB eA lat) 


Here, 6,(t) is known as the dynamical phase, and y(t) is known as the Berry phase or geometric phase. (The 


reason this phase is important is that we may not start in an instantaneous eigenstate — more generally, we'll have 


W(t) = ST Nl el) (0) [a (t)) 
k 


We can verify that y(t) is real (so that we do have a phase), because 


(Vulva) = 1 => Cals) = (ulibe) + (dale) = 0. 


Since the two terms here are complex conjugates of each other, they can only be negatives of each other if they are 
purely imaginary. 

It now remains to examine whether we can indeed drop the transition-causing terms: the terms that are being 
coupled together involve expressions that look like (Wx| Yn) for k # n. Because we have instantaneous eigenstates, 


we know that (taking a derivative) 
H|Wn) = En(t) |Yn) => A ln) +A |v) = En ln) + En |Wn) - 
Applying a bra (1,| to both sides gives us 
(x|A| dn) + (We |H| On) = En De) Yn + En (Wx|Wn) 


and now the first term on the right is zero by orthonormality, and the second terms the two sides combine if we apply 
the H on the left-hand side on the bra to make Ex (Wx|Wn). Thus, we have 


(bk|H) Wn = —(Ex — En) (We| Wn) 


and thus rearranging gives us the identity 


(il H[ Pn) 


(Ibo) = Ey EC | 


So as long as the energies do not cross (so Ex(t) — Ep(t) is not too small), and we have a slowly varying Hamiltonian 
(so that the matrix element is indeed small), the adiabatic theorem will generally hold. Indeed, if we plug in our boxed 
expression back into the Schrodinger equation, we have 


- a = apn Abel Ale) 
(ing — Ext) —Mdalds)) a2) = 9D BO Bray 


We may be concerned that the effects of H will accumulate over large time periods, so that the locally small effects 
on the right-hand side will accumulate. But this does not happen: if we write our coefficients in terms of the phases 


c(t) = e'*() e'%() & (t), we then find that the equation simplifies to 


eit dusts . Vl AlWn) —i0,¢t) i 
ine! (4) eM E(t) = in)? ( @lOn(t) @itn(t) 
nék Ex(t) = E,(t) 
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Dividing out the phase on both sides gives us 


(bk|H|Wn) ei Gn—9k)t gilMn—W)t & 
&(t) = Xe, (H=E A(t) eae 


Integrating this equation from time 0 to t, we find that 


Wk Aly a ee 
&(t) — & (0) = = aay OBOE WEEE), 
nZ#k 


And even if the window of time t Is very large, the first fraction is a slowly-varying function of time, and we're 


integrating against a rapidly-oscillating phase. So most of the integral contribution will be zero, and thus the effects 


of H do not accumulate. 


We'll talk about this in more detail next time! 


37 April 28, 2021 (Recitation) 


We'll start today by finishing our discussion of spontaneous emission, giving a derivation that is more direct for finding 


3 > 
the rate A= 4 82 | dapl?. It'll be a bit beyond the course material, but most of it fits with what we know here. 
Basically, the stimulation of this process comes not from an external field but rather from the quantization of 


photons. Here’s how we'll set that up: 


Example 167 
Consider an initial state |/) = |2,1,m) @ |0), corresponding to a particular eigenstate of the hydrogen atom and 


zero photons, and a final state |f) = |1,0,0) @|k), corresponding to the ground state of hydrogen and a photon 


with momentum ff = hk (and thus frequency w = kc). 


We'll think about having a “raising operator” for the photon 


and also lowering operators which satisfy a, |0) = 0. The point is that we have not just classical electromagnetic fields 


but quantum vector potentials written in terms of operators: we will use the fact that 


1/2 7 = 
A(7) = jk (A ) [anziet™” +alete ik? 


(it makes sense to need both raising and lowering operators so that we have Hermiticity). Here, &, tells us the direction 


of the photon’s polarization and are thus called photon polarization vectors. It's also important during this quantization 


process to explain the commutation relations: we have 
[ax, a}.] = 69(k — K’), 
which is the analog of the Kronecker delta for the continuous situation. So we'll now do our perturbation theory using 


this A: consider the harmonic perturbation 


Ar ty =e“ Ar), 
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so that we get 


a5 A 
6H =-qF-E= 47 og 


iF 2 —— 
Ot’ 
and plugging in the expression for A and using the same long-wavelength approximation we've been making so frequently, 


things simplify to 


We can thus use Fermi’s golden rule on our elt H’ Hamiltonian perturbation, and what we find is that the transition 
rate , 
T 
Wi-+Ff = inn f ok 6(E; — Er — fw) 


where we sum over the final state photon and use the fact that the remaining part of the integral involving the density 
of states is sharply peaked. (Basically, we sum over the final momentum of the photon, but we’re constrained by 


energy conservation to have E¢ = E; — hw.) We then need to evaluate the matrix element 
i IW a] = 
(FIT |i) = = (1,0, 0|a712, 1, m) © (|A(0)|0) 
This second term is the one that is new: notice that we can write |k) in terms of a creation operator, so 


(k|A(0)|0) = (0 au | 2K U2 at et, 0) 


where we've dropped the a, term because the annihilation operators make things vanish. So now we use the fact that 


apal, = alan + 6°(k — k’), and now the first term has matrix element 0 in the ground state, so we can move the ax 


past the al and pull the dk integral out of the matrix element to find that what we're left with is 


h 2 1/2 
= (integral over k) - ( sf ) &. 


4n2w 


From here, we still have to deal with the delta function 
[PKs — Ey — fick). 


The angular part gives us a factor of 47, and the radial part is fixed by the delta function so that we must have 


4m 4m 
= am f akk?6(E; — Fs —hick) = 5c kit = aoswir- 


Specifically, all ws in our formula get replaced by an wj¢, and now that completes our calculation: working out the rest 


of the integrals gives us the answer for the rate A: 


_ far Ve se ol Te 4n 5 
Wi-+F (3) (SE) id SAmue \ nee “2. 


Here, the factor of 3 comes from the averaging over polarizations like we had with the f- d term in lecture — that fi 


plays the same role that € is playing for us right now — and the final factor of 2 comes from the fact that there are 


two spin states for our photon m= +1. Simplifying everything here indeed recovers our desired formula! 


Fact 168 


Basically, the only quantum mechanical considerations going on here come from the raising and lowering operators 


ae a which don’t talk to each other for different k. 
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We'll now turn to the adiabatic approximation mentioned in lecture, in which we have a slowly-varying time variation 
H(t) (meaning that H varies slowly relative to the system's intrinsic time scales). In other words, if there is some 
internal timescale 7; for substantial variation, as well as some external timescale T. affecting time-variation, we must 
have Tj < Te. 

But we can phrase this situation in terms of energies as well — we need to assume that states are nondegenerate 
and that there are no energy crossings, so another phrasing is that the smallest energy gap for any pair of energy 
levels satisfies Amn = ming |Em(t) — Ep(t)| > 0. We can then think of our internal timescale as being T; ~ R, and 
we can think of our external timescale also in terms of H via T, ~ a (an approximation for the time that we spend 
in a narrow energy region A in which levels are closest). So again T; < Te is the approximation, and that puts some 
constraint on the time-varying nature of H: 

A h hlH| 
K 


| i; 
Al ~ a mS 


In lecture, we mentioned the concept of instantaneous eigenstates being useful: treating time t as a parameter, we 
can find the eigenstates and eigenvalues |w,(t)) , E,(t) of the matrix H(t) at any fixed time. We can then use the 
adiabatic theorem, which tells us that if our initial state |V(0)) = |w,(0)) is some instantaneous eigenstate, and H 
is slowly varying with nondegenerate energies, then ||W(t)) = e/"() e%( |ay,(t)) is a phase times the corresponding 
instantaneous eigenstate at some later time. 

However, notice that the nth energy level can be very different at different times — drastic evolutions are allowed, 


but we (approximately) do not make transitions between different energy levels. 


Example 169 
Consider the Hamiltonian H(t) = Ho+f(t)H’, so that the time-dependent part is parameterized by some function 


f which is turned on slowly from 0 to 1 during the time interval [0, T]. 


Then we have some instantaneous eigenstate |w,(0)) at time t = O for Ho, and we have some instantaneous 
eigenstate |w,(7T)) at time t = T for Ho + H’ (where H’ can be as big as Ho, for example). Then the adiabatic 
theorem tells us that as long as we dragged f(t) slowly enough over time, we do indeed evolve approximately from 


one instantaneous eigenstate to another, even if those eigenstates look completely different. 
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Last time, we started talking about the adiabatic approximation: the idea is that we have a Hamiltonian H(t) that 
changes slowly in time (though the overall change between initial and final Hamiltonian can be large). If we define 
instantaneous eigenstates |q,(t)) for the Hamiltonian H(t) at each time t, and it turns out that the instantaneous 
energies E,,(t) do not cross (and that there is no degeneracy), then starting in |w,(0)) at some initial state means that 


@n(t) ei%(t) (And starting in some linear superposition means we 


we will approximately stay in |w,(t)), up to a phase e 
have a linear superposition of the final e/*) e'(®) |a,(t)) states.) 

We started talking about the justification for this result last time, and we'll continue that here. Basically, we decom- 
pose the full wavefunction into the instantaneous eigenstates with some time-changing coefficients En(t)elOn() el nlt) 


What we need to show is that ¢,(t) is approximately constant in time under the assumptions of the adiabatic theorem: 
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we calculated the difference ¢(t) — ¢(0) to be the exact expression 


ee dt! (del Al dn) ell tn—te)t! giln—8u)t! & (4!) 
_ / ‘ 

La fo © (Ex — En)(t) 

To work with this, we need to understand how the different “degrees of slowness” work in this expression. If the 

evolution initially takes place between times 0 and 7, but then we slow down time (stretching out the time axis) so 


that T gets larger, we get a family of Hamiltonians 


H(t,T)=Ai(s= =). 


ae we parameterize s € [0,1]. So the evolution becomes slower as T — oo — indeed, the chain rule tells us that 
Tz ~ F- SO that means the matrix element of H gets smaller, but the window of integration gets larger — this means 
we oi find a way to show that the integral is nevertheless not of order 1. 


The point is that the dynamical phase @,, which ts basically an integral of the instantaneous energies, satisfies 


: ; 1 
6, _ Ox — ~Z (En —_ Ey Ce); 


and because the energies change slowly over time, locally we have 6, — 0, & #(En E,)t’. So the phase factor 
oscillates locally with a frequency determined by the instantaneous energy difference (and notably, not dependent 
on T). Since the rest of the integral for ¢,(t) — ¢(0) (the Berry phase and the matrix element) is slowly-varying in 
time, the @, rapid oscillation will wash out most of the integral to 0. This means that we only get contributions from 
the incomplete periods at the beginning and end, in which the integrand is proportional to ae (since €, is a coefficient 
whose square gives a probability, it must be at most 1). Since the duration of the incomplete period is roughly ge, 


that means that we get the estimate 
_ 4 hH 
x (T) — €(0) © (AE)? 


This is of order a so the difference does indeed go to 0, and we've shown the adiabatic theorem. (However, we do 


need to notice that this result depends on AE being strictly positive — things break down if the energy levels get too 


close to each other.) And as an additional result, the probability of a non-adiabatic transition is 


Remark 170. Our argument is pretty handwavy here (especially with the “washing out of the integral”), but it can indeed 
be made into a more formal proof. And we can improve the error bound here as well: if our system evolves between 
an initial and final Hamiltonian between times 0 and T, and the ‘turning on" and “turning off” of our Hamiltonian at 


time 0 and T is very smooth, meaning that the first p derivatives of H vanish, then 


1 
Pani) $0 ( ape) 


We'll now consider a toy setup that lets us explore the adiabatic regime and also the corrections to it: 


Example 171 (Landau-Zener problem) 


Suppose we have a system in which our Hamiltonian is a function of some parameter R, and suppose that two 


of the levels E,(R) and E_(R) come close to each other for a short period of time but don’t quite cross. (Such 


“avoided crossings” are common.) 


Before we discuss this particular problem, we'll first explain why we try to avoid level crossings in quantum mechanics. 
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If we have two energy levels, we can always write the Hamiltonian for those two levels in terms of the Pauli matrices: 


H = aol +4-G, where ao, a1, a2, a3 € R. The energy levels are then a9 + |a], where 7 = (a1, a2, a3), and thus the 
energy difference E, — E_ = 2|a] goes to zero only when 4 = (0,0, 0). In other words, we need to tune three different 
parameters to get a crossing, and thus crossings do not occur generically. 

But there is an exception to this logic, which occurs when two levels have different quantum numbers. For example, 
if [L?, H] = 0 (the Hamiltonian commutes with angular momentum), and there are two energy levels with different és 
|2) , |@’), then we can guarantee the matrix element (£|H|é’) = 0. This basically guarantees that a, and ap will be zero, 
and thus we only need to tune one parameter to get the energy levels to coincide. 

So now we'll return to our problem: we tune our parameter R, and we want to ask about the probability of making 
a transition between instantaneous eigenstates over time. The system that Landau and Zener had in mind was to 
have two atoms (such as Na and Cl) which can form a polar molecule. Specifically, on their own, they are neutral 
atoms, but when they get close together, we should think of them as Na+ and a Cl- ions. Our parameter R will then 
be the separation between the two atoms — we'll then have the ionic state (Na+, Cl-) and the atomic state (Na, 
Cl), and we know that these levels will come close at some distance, because at a close enough distance it becomes 
more favorable to see an ionic state than an atomic state. Since we mentioned that energy levels should not cross, 


the energy diagram will look something like this: 


What we'll imagine is that a sodium atom is scattered at a chlorine atom, and we want to calculate the probability 
that we make the non-adiabatic transition to the ionic state. In particular, if we move R very slowly, then we will 
follow the bottom branch of this energy diagram, so that as the sodium atom is scattered it will transition to the ionic 
state and then back to the atomic state. But there is some probability of an energy transition near the near-crossing, 
meaning that we jump to the top branch and exit in the ionic state. 

For a process like this, it makes sense to have R vary as a function of time, and the highest probability occurs near 


the near-crossing point. 


Example 172 
But we'll instead do an even simpler problem: we ignore all energy levels except these two, and we start with a 


Hamiltonian 


(We'll soon add an additional parameter into this expression.) 


0 
The instantaneous eigenstates are always |1) = F 0| and |2) = ] , with energies +2 so that these energies 


intersect at time 0. We now have a true energy crossing, and we can solve the time-evolution exactly: the time- 
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dependent Schrodinger equation gives us 


nt) =e0(—1 f ex(e)ae’) 11) =e (—8E) i 
1 = exp A Jo 1 = €xp ane , 
and similarly the time-evolution for the other state is 
iat? 
fda(t)) = exp (BE) 2). 


But we can make the problem more interesting by adjusting our Hamiltonian so that we avoid the level crossing: 


suppose that we introduce a constant off-diagonal term 


H12 
H(t) = 7 at|’ 
12 ~ 2 


where to be concrete we'll make Hj2 > 0 a real number. We then have H(t) = Sg? + Hy20%, and the eigenvalues 


of this Hamiltonian are + = + H%,. Our energy diagram now looks as below (blue is with Hi equal to zero, and 


purple is with Hj. nonzero): 


> t 


The interesting point here is that our system's instantaneous eigenstates change from those of a7 to those of o* 
and back as t goes from —oo to 0 to oo — along the purple paths, the spin rotates from +z to —x to —z, or from —z 
to +x to +z. (We actually switch between the two levels on a given branch!) So we'll phrase our question as follows: 
if we start in the lower branch at time t = —oo, what is the probability of a non-adiabatic transition — that is, what 
is the probability that we end up in the higher branch at time t = co? 

The characteristic timescale Ty over which the instantaneous eigenstates are changing considerably can be estimated 
by looking at the horizontal dashed line above and letting its length be 2Tg. We then have 


Hi2 
mae 


. »2Tq = Ayo Tq 


For the adiabatic theorem to apply, this time must be much larger than the intrinsic timescale of the quantum problem, 
which is determined by the energy difference between the states: this intrinsic timescale is largest when the levels are 


closest, so 
fi fi fi 


Tio & ro ~ i 
a he Bi. Ags 


2 
So the adiabatic theorem applies if Tg > Ti2 => Ae >> 1. (And this is in fact the only dimensionless parameter in 


136 


the problem.) In this case, we don’t actually need the adiabatic theorem to find the probability of a transition, which 


can be found analytically to be 
2 
Pron-adiabatic = XP (-2n52 ) . 


So this transition probability is exponentially small in a@ — basically, if a@ is small, we stay in a given branch, but if a@ is 
large, we will basically follow the blue lines and switch to the other branch with high probability. (This is confusing, 
though, because staying in a branch means we will transition from spin-up to spin-down! That's happening because 
the instantaneous eigenstates change drastically over the evolution.) 


From here, we'll move on to a discussion of the Berry phase (named after Sir Michael Berry) 


L 
tami ff at! (Vole). 


The role of this phase is particularly important in various applications like condensed matter physics and quantum 
magnetism, and we'll understand it now. Suppose that our Hamiltonian is a function of some number of parameters 


H(R) = H(R;1, Ro,--- , Ry): for each value of the parameters, we have some eigenstates satisfying 
H(R) |wn(R)) = En(R) [abn(R)) . 
If R is a function of time, we then get a Hamiltonian H(R(t)), and we then have instantaneous eigenstates 


H(R(t)) [n(R(E))) = En(R(t)) [vn R(t))) 


Proposition 173 


The Berry phase only depends on the geometry of the path R(t) in this parameter space (and that’s why it’s also 


called the geometric phase). 


Proof. We know that 


Yn(t) - | dt'y,(t’), 


where we write in the R dependence 


un(t) = i (bal Rt)| Se bl) 


Taking the time-derivative by the chain rule, we find that 


0 dRn 
dt 


d SB J dRi | | = Vi 5 . dR 
Fz [Pal RCED)) = pe Mn) St + Sen) S™ = Ve [Wal R(E))) FE 


Plugging in, we then have ; 
. 3 = = dR 
Un(t) = i (Yn(R(t))|Valwo( R(t))) - 
so that the Berry phase is 
j : / D/ +! = Sal dR 
alt) =i ff de dal R(e))| Pa dn( REED) » Se 


If we say that R(t) travels along a path T from R; to Rr, then we're doing a line integral 


=i f (bolR)|Velvn(R)) - a8 
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This indeed doesn't depend on the parameterization of the path, so the Berry phase is indeed only dependent on the 


geometry, as desired. 


The integrand in our proof above, 
An(R) = i (dnl R)|Va|vn(R)) 
is known as the Berry connection, and the Berry phase Is then the integral of the Berry connection over the path 


.. We'll discuss more about the properties of this Berry phase, including its gauge dependence for the instantaneous 


eigenstates, next time! 
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We'll discuss the adiabatic approximation today, doing a few examples. Recall that we can make this approximation 
when we have a separation of timescales T; < Te (meaning that the time for a significant change in the external 
Hamiltonian is much longer than the timescales internal to the system). In this situation, we can write our wavefunction 


in terms of the instantaneous eigenstates 
[W(t)) = eleltl) lep,(t)) , 


where |w,(t)) is the eigenstate of H(t) at time t (so we treat time essentially as a parameter) with corresponding energy 
E,(t). We then have the dynamic phase 6,(t) = —# : dt'E,(t') and the geometric phase y,(t) = i dt'v,(t’), 
where Yn = i (tn|Wn) (which is real by properties of the inner product). 


Example 174 


Consider an infinite square well with walls at x = 0, a, and suppose that a changes in time as w(t) = a+ vt (so 


our right wall is moving to the right, making our well grow wider). 


We know the energies and eigenstates of an infinite square well — at all times t, we must have w(x = 0,t) = 
w(x = w(t), t) = 0 (that is, the wavefunction vanishes at the two walls). 
We can turn this into an adiabatic problem, asking when the approximation is valid and seeing what solution we 


end up with. For simplicity, assume that we start in one of the instantaneous eigenstates at t = 0, namely 


2. /nnx nem he 
Halos 0)= y/2n (4), En = mae 


More generally, the instantaneous nth level eigenstate is 


(2. 7 nnx nem h? 
wne(x, t) = wo a EN ee 


(since all that’s changed is the width of the well). To figure out the relevant internal timescales, notice that the 


difference between energy levels can be bounded as 


wh? 
En-—Em> b= mae 
(treating w as order a), which gives us an energy scale of 
h h 2 
AE & h 
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Then the external time is the amount of time it takes to make a significant change in the width, which is Te ~ 2. So 


for our approximation to be valid, we will want to have 


2 
a ma fh 
Te.>7j => ->— | |v«K« — |. 

V fi ma 


But once we make this assumption, we can notice that the energy levels all change in the same manner and thus there 
are no energy crossings. This nondegeneracy then allows us to make our adiabatic approximation, and our answer now 


depends on finding the phase factors. We have 


oa ie wh fae oh | i - 
6,(t) = -; | qreAr) = 5 = : : 
A Jo 2m Jo (atvt') 2m voatvt'l, 
which we can further simplify to 
mnnrif(1 o1l\  wnn? vt —| wAn? t a 
2m v\a w 2mv aw(t) 2ma2 fiw(t) | 
So now notice that mh is the energy E,(0) of the original eigenstate, so multiplying by —; gives us the usual 


time-evolution phase! The adiabatic approximation thus gives us an extra we term for that phase. We can also 


calculate that the y,(t) phase turns out to be | 0 |, because the instantaneous eigenstates are all real, and thus the 


time-derivatives are also real. That means (n|Wn) is real, but we also showed it is in general purely imaginary, so it 


must be zero. So plugging everything back in, we get the wavefunction 


Vos) = WiC teen] EHO 2 


fh w(t) 


for a slowly-moving wall. 


Example 175 


We can actually compare this answer to the exact answer to figure out the accuracy of our adiabatic approximation. 


Basically, if we solve Schrodinger’s equation inZu(x, t)= —F Sux, t), we get the relatively simple solution 


(so we just gain an extra exponential factor at the end). We can then think about superpositions of these solutions, 


and we want to ask whether it makes sense to write 
ese eo Cana) 
n 
It turns out the answer is yes, because the WV,,s are orthogonal — for example, at time t = 0, we can notice that 


(Vn(0)|Wp(0)) = i dt, (x, 0)Wa(x, 0) = Sm 


because the extra phase term cancels between the bra and the ket. If we now think about starting in the ground state 


W(x, 0) = [2s (=) , 


then we are not in the ground state of the general Hamiltonian, and instead we can compute the coefficients in our 


of the original well 
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Vm basis to find 


: 2 [? , .. fAX\ 2. (mmx imvx? 
n= f dxW (x, 0)W* (x, 0) = =| dxsin ( - ) sin ( - ) ex ( Pha 


mva 
Qn7h 


SG _ ; 
Making a change of variables z = 7x and letting a = gives us 


o) Tv 
Cm = = | dzsin(z) sin(mz) exp (—iaz?) . 
0 


Noticing that a is the term that we assumed was small in the adiabatic approximation v< a we can Taylor expand 
it to 1 to find 


2 Tv 
Ca al dzsin(z) sin(mz), 
T Jo 


so that Cm = dm1. In other words, in this limit, we do indeed follow the m= 1 level: there are no transitions because 
the extra factor is negligible, and thus we just have W(x, t) = wiPs*t(x, t)e/ like before. 


We'll finish with another example: 


Example 176 


Suppose we have a magnetic field which is oscillatory (instead of linear) so that 


B(t) = Bo(sinacos(wt), sinasin(wt), cos a). 


In other words, we have a magnetic field that rotates around the z-axis, and we want to understand what happens 


if we have a Hamiltonian 


on a two-state system with spin 5. We can write this Hamiltonian as a matrix 


—iwt 


fiw cos a e sina@ 
G9 a aera | 
2 Jeliwt]sina —cosa 
where W, = — is a characteristic frequency of precession. The eigenvalues of such a matrix are then Es = Pe and 


then w, Ww can be thought of as the external and internal frequencies, respectively. So in the adiabatic limit, we want 


Te >> T, = |w<w) 


(since the characteristic timescale is the reciprocal of the characteristic frequency). Our instantaneous eigenstates are 
then 
cos & sin = 
2 2 
X(t) = ie | x(t) = ee ‘ | 
el? sin S —e cos 5 
with (time-independent in this case) energies E,,E_, respectively. So we will apply the adiabatic approximation: 


suppose at time t = O, we're in the x4 state, which is corresponding to spin up along the direction of B. Since 


E, > E_ (there are no energy crossings), the adiabatic theorem tells us that 
w(t) = el Me [Ly (2), 


Because the interpretation of this x, state is as the spin-up state in the direction of the magnetic field, what the 


adiabatic theorem tells us is that for a slowly-varying magnetic field, our spin will always align with it. 
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We've been discussing the adiabatic theorem recently, understanding for a slowly-varying Hamiltonian, our system 
stays in the instantaneous eigenstates up to a calculable phase (which is different for different energy levels, so this 
phase becomes important when we deal with superpositions). Last time, we started discussing Berry’s phase 7, = 
ae dt’ (nln) — we found that the phase can be thought of as a line integral in parameter space R= (Ri,---, Rw), 


in such a way that the Berry phase depends on the path taken but not the parameterization: 
Vn = [ dR ALR, 
: 
where we have the Berry connection 


A,(R) = i (bal R)|V 


Vol R)) 


In contrast, the dynamical phase @,(t) does depend on the details of how we evolve along the parameter path [: for 
example, if E,(t’) = E, is constant along the evolution, then 6,(t) is proportional to t, so it does depend on how 
quickly we traverse our path. 

We'll focus more on the Berry connection now: notice that the states wr(R) (the eigenstates for the Hamiltnoians 


H,(R)) are only defined up to a phase, so we can define 
lwi(R)) =e Jan R)) 
where B(R) is real. Then we notice that 


A(R) =i (V(A)|Ve 


Wh(R)) = i (nC R) eV ge-* | a(R) ) 


(we can always take R out of the inner product because it’s basically a constant parameter, not like the position 


operator). And now we can simplify this by applying the gradient on the two terms to the right: 


=i (Yn R)V x 


Un(R)) + V8 (Yo(R)] bn). 
Since (vol R)| a(R) = 1, what this tells us is that 
A(R) = A,(R) + V 96. 
This looks a little bit like a gauge transformation, and this changes our Berry phase to 
tha [dR A(R) = [dR (An) +956), 


and now the first term is the ordinary Berry phase, and the second term is the line integral of a gradient, which 
simplifies to the difference at the endpoints 6(R-) — B(R)): 


Yh = In + B(Re) — BCR) | 


So this is not a gauge invariant quantity for an open path [, but that’s not actually a problem — it turns out that 
combining the changes to the Berry phase and the wavefunction wn(R) will give uS a consistent answer for the 
evolution of our system (the changes cancel out). And there's also a special case where [ is closed, meaning that the 


difference B(R-) —B(R;) vanishes, and thus ¥, is a gauge invariant quantity for closed paths (this makes sense because 
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we can really only compare the initial and final phases meaningfully if they are relative to the same Hamiltonian H(R)). 


Here's a few other points that are important for us to keep in mind: 


« As mentioned in recitation, if w,(t) is a real wavefunction (in position space), then y, = 0, because 


. d 1d d d 
Y(t) = i (nl Vn) = if Prabal t) a val? t)= if ore (wil, t)?) = is, | orvRr, t) = pre = 0. 


This is not surprising, because if W, Is real, i (n\n) is also manifestly pure imaginary, but we've defined it to 


always be real. 


As a special case of the closed loop T that we considered above, if our loop is trivial in that it travels along 
some path [; and then retraces that path backwards, then the Berry phases will always cancel and y, = 0. In 
particular, this is always the case when our parameter space is one-dimensional and we have the same initial and 


final value of R. 


Finally, suppose we have a three-dimensional parameter space, and we want to calculate the Berry phase along 


a closed loop 
Yn = p A(R) dR. 
: 


By Stokes’ Theorem, we can write this line integral as 
= [ (x AA) -d5 = [ 5, -45 
iS s 


for any surface S with boundary 0S = [, and we have the integrand Dr =Vx A,(R) is called the Berry 
curvature. Notably, this Berry curvature is gauge invariant under the transformations An Ant VaG that we 
just discussed, and we can think of this D, as serving a similar role as the magnetic field and how it relates to 


the vector potential! 


Remark 177. /f we think about electromagnetism in spacetime, the vector and scalar potential are unified into a single 
four-vector A, = (, A), with transformations Ay, + Ay +0, and field strength (including both electric and magnetic 
field) Fup = O,A, —O,Ay. So if we generalize the previous discussion to more than three dimensions, we can construct 


a more general two-component antisymmetric tensor in the place of D,(R). 


Example 178 


= 


, so that the Hamiltonian is 


Consider a spin 1/2 electron in a magnetic field B = Bofi for some unit vector fi 


H=-f- B with electron magnetic moment ff = —gig- 3 & —Ueo. In other words, we will use the Hamiltonian 


H(A) = (ue Bo): & with parameter 77 living on the unit sphere. 


We know that the eigenstates for each direction 7 are the two states |f7, +) with eigenvalues +g Bo (since (7: G) 


has eigenvalues +1). If we want to let 7 change direction in time, slowly enough so that the adiabatic theorem will apply, 


fh 
HpBo* 


But if we're in this regime, we have a slowly-rotating magnetic field, and we can calculate the Berry phase that is 


we need the characteristic time of variance in f to be much larger than the quantum timescales tquant = at ~ 


picked up along a closed path. If we follow the |77,+) state, we have 
V+ = [A -dR, 


and (parameterizing with polar coordinates, left as an exercise) we find that the total Berry phase will evaluate to 


—$Q, where ( is the solid angle subtended by the path (specifically, the solid angle of the region that we traveled 
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counterclockwise around). And for a more general spin S (instead of a spin 1/2 particle), we know that (S - 7) has 
eigenstates running from —s to s in steps of 1. If we're in the top state with eigenvalue s, the Berry phase will then 
turn out to be —sQ. 

Suppose now that we have a family of paths that run along the “parallels” or “latitudes” of the sphere, and consider 


the one at angle 0. The solid angle subtended is then 
Q(0) = 2m(1 — cos @) 


(we can check that this works for 6 = 0, 5,7). In particular, if we take a path exactly at the south pole, we pick up a 


Berry phase —4mS, but because such a path is actually staying at a fixed point, no phase should be picked up! Since 
our phase is only defined modulo 27, this tells us that 


—4nS €2nZ => 2S €Z, 


and thus we've found an argument for why our spin needs to be a half-integer (which is indeed the case!). 


Example 179 


We'll finish the adiabatic approximation discussion by thinking about molecules and the Born-Oppenheimer 


approximation, and our central question here is to determine the spectrum of molecules. 


We already know that atoms with atomic number Z > 1 are already difficult to deal with, because of factors 
like the Coulomb repulsion between electrons, and the analysis of molecules becomes even harder. For example, if 
we consider H», even fixing the two positions of the nuclei and only considering how the two electrons behave, we 
lose rotational symmetry except along the axis connecting the two nuclei, and that makes studying the system a lot 
harder. 

Furthermore, it isn’t clear why we can fix the positions of the two nuclei — the distance may change, so vibrational 
motion factors seem to play a role. But we can handle this part of the problem, because nuclei are much heavier 
than electrons (typically the ratio is about 10~*). So the nuclei move much slower than the electrons do, and thus 
we can apply the adiabatic theorem, freezing the separation distance R between the nuclei. This then gives us some 
nuclear potential Vy(R) for configuration of the nuclei, and this potential will generally have an equilibrium separation 
(because at large separations we have a behavior due to van der Waals, but for small separations we have very large 
V due to Coulomb repulsion). So we can then consider small vibrations along this equilibrium separation, and we can 
say that the electrons adiabatically adjust to the “instantaneous eigenstates” of the nuclear evolution. 

More quantitatively, the typical energy scale for electronic excitations is Ry ~ me (where m is the electron mass), 
and similarly the typical energy scale for nuclear vibrations in the potential Vy(R) is also the Rydberg. Since the length 
scale for the nuclear vibrations is a9, we can estimate the spring constant via 

pe [E). tei, 


Ape? eee knw = 
N= 9M 72 = Le a5 mag 


This then tells us the frequency of vibrations 


7 js [m 
_= aaa M mae’ 


In other words, the energies associated with the vibration, fwy, are on the order of ‘le 


oa so the vibrational energy is 


smaller than the electronic energy scale by a factor of (a = 10-2. So the vibrational frequencies are indeed smaller 
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than the electronic frequency scales, and the vibrational times are thus sufficiently large and the adibatic approximation 
works. 

In addition to the effects due to changing the equilibrium separation between the nuclei, we also need to think 
about rotations of the two-nuclei system as a solid body: in other words, we want to estimate the timescales associated 
with molecular rotation. Since the rotational energy can be written as E ~ Lo for moment of inertia /, we can write 


this as a ~ Rae So the rotational energy is suppressed by a factor of 7 ~ 10-*, and the main point of these 
{0} 0 


ma 
m m 
Eelectronic : E vibrational : Exotational =1: hoe RA A 
MM 


We can in fact compute the effective potentials and solve for the nuclear motion more explicitly, but these are the key 


calculations is that 


points of the Born-Oppenheimer approximation (which we can read more about on our own). 

We're now ready to move to our next topic of the class, scattering, which is relevant to a variety of fields of 
current physics research. One example of scattering comes from taking a beam of electrons or X-rays and shooting 
them at a target crystal, looking at the distribution of the outgoing directions and energies to study the crystal and 
electronic structure. And another one comes from taking two protons (or other particles) and smashing them together 
in the Large Hadron Collider, looking at what particles come out and at what angles and energies to understand 
what different particle processes are possible. Scattering processes determine properties of what's around us (oxygen 
molecules interacting in a gas, or electrons scattering in a material), and we'll start exploring this in the coming 


lectures! 
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Today, we'll do a nice application of the adiabatic theorem, the solar neutrino problem. In the 1960s, it was found 
that there was a deficit of electron neutrinos produced by the sun compared to what was expected by models, and this 
problem was only resolved in 2002. So we'll describe the story here and understand how the adiabatic approximation 


plays into the solution. 


Fact 180 


During solar fusion, neutrinos are produced via a process taking an electron and a proton to a neutron and an 


electron neutrino (in symbols, e~ p — nve), and this is a weak interaction. 


In practice, the lowest energy neutrino comes from the process 4e— + 4p > *He + 2ve (where each helium atom 
has two protons, two neutrons, and two electrons), and another process comes from 8B+e- > ®Be+ve. And what 
was found was that the energy of the neutrino (which depends on which process it’s produced by) affected the extent 
to which the neutrino output deviated compared to what was expected. 

In the SNO experiment, a big balloon of D2O (replacing hydrogen with deuterium) was put far underground. 
Because neutrino oscillations between different kinds of neutrinos (Ye, ¥,,¥,) can happen — for example, electron 
neutrinos can scatter off deuterons and cause the process ve +d — pp+e™7 — the power of this experiment was that it 
could keep track of the total flux of neutrinos. It turns out that the total flux, 5.1 x 10°cm~?s~!, was pretty consistent 
with what was expected, and Z of the electron neutrinos ve born in the sun changed into the 1, or v, types. 

So to account for the measured deficit from the sun, there were two explanations given. The first one, “just so,” 
is that the electron neutrinos oscillate into the other types as they travel between the sun and the earth. What this 


would predict is that the oscillation depends on the distance traveled and the energy of the neutrino, and the idea was 
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that the ratio of those quantities was just right to line up with the mass difference of the neutrinos. But this one was 
ruled out by another experiment, and instead the explanation turned out to be that the electron neutrinos oscillate 


into other types of neutrinos inside the sun. 


Example 181 


To study the “just so” explanation, we'll consider a two-state system {|Ve) , |¥,)}, which we can also describe with 


basis {|vH) ,|vz)} (“heavy” and “light”), the energy eigenstates. 


We can thus relate the two sets of bases via 


|¥L) = (cos@) |ve) — (sin) |yu), |v) = (sin @) |Ye) + (cos @) |Yu) , 


cos? —sin@ 
where @ is the mixing angle — we can alternatively write this in terms of the change-of-basis matrix U = | : 


sin@  cos@ 
Mathematically, just-so oscillations (our first explanation) would take place in vacuum, and they can be explained as 


follows: we know that our energy eigenstates satisfy 
_d a d 
in |i) = Ew |v) j ine |v.) = Ex; |.) ; 


and because the masses of neutrinos here are much smaller than the other energies in the problem, we actually have 


an ultrarelativistic problem: the energy E = ,/p2c? + m?c* can be expanded as 
a2 2.2 
myc mec 
E _ A ea E = L ae ; 
4 \VH) (oc + She + ) len. L |VH) (oc + ane - ) i) 


We'll define these two energies to be Ey = Eg + Ap and Ey, = Eo — Ao, and we can explicitly calculate that 


Am? c* 7 Am? c4 
Apc — 4B 


Fo spc, Ag= 


So in two-component notation, we can describe the light and heavy components of our state as 
‘ d jay E, — Ao 0 aL 
ih— = ; 
dt ay 0 Eo + Ao} | ay 


a a 
but because we're interested in the two different types of neutrinos, we need to change to the ] =U i basis. 
au aH 


Eo—A 0 
pee) =u 
dt au 0 Eo + Ao 
and doing out the matrix multiplication gives us 


_ | Eo — Ao cos(2@) —Ap sin(26) ae 
lhe dee sin(20) Eo + Ao cos(26) 


We therefore get 


Au 


This blue term is our Hamiltonian, and we have shown earlier in this class that if we're in the A state at time t=0 


(that is, we have an electron neutrino), then we can calculate the probability of being an electron neutrino at some 
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later distance from the sun to be (assuming that x = ct, traveling near the speed of light) 


w (t = =) = 1—sin?(26) sin? (=) : 


So the dependence indeed depends on the value of EB since there is a 5 dependence in Ao. In other words, if vacuum 


P(ve, x) = (ve 


oscillations were the reason we saw less electron neutrinos than expected, then we'd need to have exactly the right 


value of & so that the sine term was at the right phase. 


Example 182 
We can now turn to the other explanation, which is oscillation of neutrinos inside the sun. The main difference is 
that the sun is full of electrons and has some density p(x) of electrons per unit volume, so electron neutrinos can 


scatter off electrons and change the Hamiltonian: 


Hsun = Fac + 


V2Grp(x)(hc)? 0 
0) o| 


where /2Gpr & ance is the Fermi constant. 


This extra term means that the Hamiltonian now looks like 


5 (x)(fc)° — Ag cos(26) —Apo sin(2@) 


Hsun = (constant)! + , 
sun = ( ) — Ap sin(26) — Feolx)(Nc)? + Ao cos(26) 


and we can see some connections to the Hamiltonian H = 


t A 
i | from lecture: recall that in that problem, the 
yt 


adiabatic approximation allowed the spin-up state to slowly rotate into the spin-down state. That’s similar to what’s 


going on here: 


|Ve) Vin) 


a 


[Vu) 2 |Ve) 


Basically, x (the distance from the center of the sun) plays the role of t, the —Apsin(26) takes the role of the 
off-diagonal term A, and the diagonal term in our Hamiltonian Heyn takes the role of yt (though it’s not a perfect 
analogy, since p(x) is not linear in x). So then the “time-evolution” of our system is that the center of the sun is on 
the left side of the picture (very negative x) and the surface of the sun is on the right (very positive x), so the y-axis 
in our diagram above corersponds to the “balancing point” x = x- where 

Gr 


alc) lx) = Ap cos(28), 
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meaning that the diagonal terms of the Hamiltonian vanish. If we want to use the adiabatic approximation, we need the 
off-diagonal term to be much larger than the diagonal term when the energies get close to each other, meaning that 
(instead of a time-derivative we have a spatial-derivative, since x is playing the role of t) the adiabatic approximation 
is valid when (constraint (1)) 


Aj sin?(26) >> n(ne)® Fe al). 


(This formula is the analog of A? >> fH.) And furthermore, we also need (constraint (2)) 


Gr 
V2 


so that the x = x- coordinate is actually passed through by the neutrinos (and thus we can actually follow the paths 


(hc)? p(x = 0) > Ao cos(26), 


in the diagram above to switch |ve) into |v,,)). And it turns out that the parameters in nature work out so that we 
are in a regime where (1) and (2) are basically satisfied (the actual value of Ag is large enough that the adiabatic 
approximation is valid, and it is small enough to basically be on the boundary of constraint (2)! So in summary, the 


density of electrons allow electron neutrinos to change into muon neutrinos because of this adiabatic evolution. 


42 May 6, 2021 


Last time, we briefly started talking about scattering in the context of collisions of particles. More than two particles 
can be created when two particles collide, and those products don’t need to be the same type as the incoming particles, 
but when the initial and final products are the same, we call the process a scattering process. 

Furthermore, we have elastic scattering when the internal state of the particles does not change. (In contrast, an 
example of an inelastic scattering process is where an electron Is shot at a hydrogen atom, causing the other electron 
inside the hydrogen atom to jump to an excited state.) We'll focus on elastic scattering here, and we'll make a few 
additional assumptions as well: we'll assume our particles are nonrelativistic, we'll ignore spin, and we'll assume that 
in a two-particle scattering with particles at positions fF, %, the interaction between them is via a potential V(7A — 4) 
only depend on the distance between them. 

We can then switch to a center-of-mass reference frame, where we have a single particle scattering off of a 


potential 
=) 


p 5 

Vir) = — + V(r), 

(N=5, tn 
where tu Is the reduced mass. It turns out that even with these simplifications, we'll have a lot of calculations to do! 
Since this problem is (strictly speaking) time-dependent, we can reduce it to the time-independent Schrodinger 


equation with appropriate boundary conditions. 


Example 183 


We'll first recall how we make this kind of reduction from the time-dependent to time-independent Schrodinger 


equation in one dimension. 


The setup is that we have some potential V (imagine a mountain, as we would see in scattering), and we have 
an initial state (at t + —oo) given by a wave packet moving to the right. Our final state then has a transmitted 
wave packet (moving to the right) and a reflected wave packet (moving to the left) at t + oo. And to figure out the 
behavior of this system, we can solve the time-independent Schrodinger equation 

he d? hk? 
( +V()) b= Ev, E = — 


2m dx 2m ' 
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where E depends on the initial wavepacket, with the scattering boundary conditions 
w(x) = Ae’ + Be’ as x — —00, W(x) = Fe'* as x + 00. 


Then we claim that 7 = [517 is the transmission probability (this is the probability that our particle is found on the 
right of the mountain potential), and R = |Z is the reflection probability (this is the probability that our particle has 


been reflected back to the left of our mountain potential). 


Example 184 


We now generalize to three dimensions, supposing that we have a potential V(r) localized near the origin going 


to 0 as r > co sufficiently quickly. Suppose that at t + —oo, we have an incoming wavepacket incoming, moving 


in the +z direction. 


We can then describe the final state as t + oo as a scattered wavepacket, which will basically be a spherical 
outgoing wave, plus an unscattered wavepacket moving in the +z direction (corresponding to the particle missing 
the potential). We now want to turn this into a time-independent problem, and we can do so by solving the three- 


dimensional time-independent Schrodinger equation 


; 
(- FeV? +V9) 007) = EWE. 


We'll assume that V(r) — 0 vanishes faster than 2 as r —> co — In fact, often we will assume for simplicity that 


2k? 


V(r) = 0 for r > a. Since our energy can be written as E = =~, our Schrodinger equation becomes 


2 
( Pv? + k?) 4 vi) w(F) =0, 


and for a fixed energy (in other words, a fixed k) this equation has infinitely many solutions. (For example, when 
V(r) = 0, our solutions are the plane waves (7) = eK? where |k| is fixed but the direction can be arbitrary.) So we 
need to impose some boundary conditions, and motivated by the one-dimensional discussion, our incoming wave should 
be w(F) = e/k?. But our scattered wave cannot be w,(7) = e!/*’, because that isn’t a solution to the free-particle 
Schrodinger equation (remember that our scattered waves are supposed to be the asymptotic forms, so far away from 
the potential they should behave like free particles). Instead, we make a small tweak: it turns out that _ does solve 
the Schrodinger equation for r #0. (Here, r and k are just the magnitudes of 7 and k, so they are scalars. ) 

Noticing that this solution has no angular dependence, which is bad because we know that the scattering process 
should have some dependence on the angle, we make the general ansatz 


elkr 


W(P)h(8,) = 


p 


So putting this all together, we'll thus look for a solution to the Schrodinger equation satisfying 


elkr 


p(7) =e" + (8,6) 


as f + OO}. 


r 


Here, f,(@, 6) is known as the scattering amplitude (it plays a similar role to F in the one-dimensional case), having 


dimensions of length, and it will be determined by the details of the potential. 


Remark 185. Notice that in one dimension, the space away from the potential is disconnected, but in three dimensions, 
the space is connected (we basically have R? minus a ball). So we can't have different expressions to the left and to 


the right and use different coefficients A and B like we did in one dimension! 
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Remark 186. We're really guessing the form of the solution here — if we want to see the work of how the reduction, 


we can consult Messiah chapter 10 or Merzbacher chapter 13. 


In physical experiments, what is actually measured in a scattering experiment is the differential cross-section. 
Basically, we set up a beam of particles moving towards the target (of potential V(7)), and then there is a detector 
far away from the target (where the potential is basically 0), which counts the number of particles which hit it. If the 


detector subtends an angle dQ, then a steady beam of incoming particles corresponds to a steady beam hitting the 


dNout 
dt '' 


normalized quantity, we divide this by se which is the flux of incoming particles. This gives us the differential 


detector. So we can compute which is the number of particles hitting dQ per unit time. But to get a more 


cross-section 
dN“ /dt 


~ dN'"/(dt dA) 


This has dimensions of area, and it will be proportional to dQ. 


do 


Remark 187. Classically, the reason for the name of “differential cross-section” is that if we have a bunch of classical 
particles moving (deterministically) from the beam through the potential, then a given solid angle dQ can be traced 


back to a particular region in the original beam, which will generally come from some area element do. 


But we want to relate this differential cross-section back to our Schrodinger equation, and we do so by noting that 
the numerator and denominator only depend on behavior far away from the potential, so we can use our boundary 
ikz 


conditions. First, we calculate the denominator: since the incident beam is the plane wave w = e’**, and we can 


calculate the flux by looking at the probability current 
= h ‘ hk . 
Jin = —Im(w Vv) celta a 
m m 


This is indeed along the z-direction, and therefore 


dN = ._ fk 
dtdA 1" 7~ m 


Now for the numerator, we want to calculate the flux of outgoing particles by looking at the scattered part w, = 
kr 


fx (0, d) x , and this time we have the probability current 


- h . 
Jout = —|Im(~eVwps). 
m 


Using the fact that V = oP 1 26 5 a in spherical coordinates, we get 


ikelkr elkr oe 1 Of, elkr a 1 Of, elkr x 
r r2 


Vus(r) = (8, ) ( roe r ' rsind Od r 


Since we are interested in this behavior far away from the scattering, the order A terms all be dropped, and only the 
first term remains. Thus this is approximately (8, o) Kee, and plugging back in gives us 
> _ hk|h&(9, 6)/? . 
Jout ore — eS 

m r 


This means that 


dN a. , _ fk 
dt = joi 0A = Jagr fdr = —|f,(8, $)|? dQ, 
m 


and thus the numerator is MEV A(8, ¢)|*. Putting everything together yields 


do = |f, (0, )|?'dQ |. 
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As expected, this number is proportional to the solid angle, and thus we commonly use the notation 
do 
—~ = |fk(8, o)|* 
= = \fi(0. 0) 


even though this is a ratio of differentials rather than a derivative. So the quantity that is measure by experimentalists 
is the scattering amplitude that we can find by solving the time-independent Schrodinger equation! And if we want to 


find the total number of particles scattered in any direction, normalized by the flux of the incoming particles, we get 


o= | do = | \K(6.9))200. 


So everything goes back to solving the Schrodinger equation. First, we assume that our potential V(r) = V(r) has 
spherical symmetry (this does not mean that the differential cross-section is uniform, because our incoming particle 


beam breaks the symmetry). 


Example 188 


We will first review how to solve the three-dimensional free particle Schrodinger equation in spherical coordinates 


(meaning that V = 0). 


In general, our solutions will have an energy E = ae and they will be of the form 
_ Ue o(r 
WF) = Feely, (8,0), 


where Ug, solves the one-dimensional Schrodinger equation 


fh? d? hk? 
( on ae Ven Veg = aa Ue 2(r) 


for the effective potential 
h? £(2+1) 


Venl) =V() + 


Since V = 0 in this case, our equation simplifies to (dividing through by a constant) 


( d= &(@+1) 


dr2 r2 


) Uee(r) = k*Ue g(r). 


If we introduce the dimensionless p = k-r, then our equation becomes 


2 
(-s + ac >) Ui Ueroy 


Notice that the energy is not quantized for the free particle, which is why it has dropped out of this equation! And 


the solutions to this last equation are known and given by 


Ue.e(P) = Aeple(e) + Beene(p). 


Here, je(p) is the spherical Bessel function 


1 ate 


io) = (tet (2) 
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and ng(p) is the spherical Neumann function 


1d ) cos p 


=e € s\s 


It turns out that je(p) is not singular as p — O, but ne(p) is singular. And we also have the asymptotics pjp(p) > 


an en 


sin (o — 4) and pne(p) + — cos (p — $F) as p > oo. 
Since we also know that plane waves are solutions to the free particle, that must mean a solution like e 


'kZ can be 


written as a linear superposition of the spherical solutions: 


elkz = elkr cos @ = » Cem Ful Vp m(8, ¢). 
m 


Combining the coefficients in our solution for Ug. with the cym, we can write this as 


= So (Aemie(kr) + Bemne(kr))Yom(8, 4). 
£m 
But because the left-hand side has no ¢-dependence, we must have Ay m = Bem = 0 for m 4 0 (because Yo m(0, d) x 
elm?) In addition, because the left-hand side is finite at the origin, but the Neumann functions blow up at the origin, 
we must only have nonzero A coefficients. From here, we can in fact find the coefficients and get the Rayleigh 


formula 


el = Van) V20+ 1i*%0(8)ie(Kr) |. 


£=0 


In particular, all angular momentum values @ enter in the superposition! And because we know that as r + oo, 


1 ln 1 fies) ers 
Pe eg 7 
sc aa sin ( e ) kr ( 21 


we can plug everything back in to find that 


Van = i fel@-2) eka) 
eX? ws ——_ S/28 + 1% 0 (0) — 
k > 2.0 >) r r 

é=0 
So we have a sum over angular harmonics, and within each angular harmonic £, we have a spherical wave going outward 
and a spherical wave going inward! And next time, we'll learn how to extend this analysis to the case where we have 


a finite potential V(7). 
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We'll discuss scattering today, thinking about how we can transition from wave packets into the time-independent 
Schrodinger equation and also looking a bit at one-dimensional scattering. (In lecture tomorrow, we'll get a bit more 
into the main techniques that we have for solving scattering problems.) 

Recall that the setup of three-dimensional scattering is that we have a potential V(X) (which is usually localized 
so that it is only nonzero in a bounded region) which we scatter a beam of particles off, and in particular we often 


measure the number of particles scattered to a given solid angle 


d2 Nyscattered : d2 Nincident do 
dtdQ dtdA dQ 
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In an experiment, the left-hand side is something we count with a detector by looking at some dQ = sin@déd¢, 
and the first fraction on the right-hand side is the flux that we're sending into the system. So aoe the differential 
cross-section, is the relevant quantity to predict using quantum mechanics. 

By convention, we send in our beam of particles traveling in the +z direction which we can think of as a wave 
packet, and we can imagine that it produces an outgoing wavepacket in the +z direction, plus a spherically outgoing 
wave. But we can connect this to the energy eigenstate description as follows: suppose that the size of our incoming 
packet is Az, and suppose the spatial range of our potential is R (this was a in lecture). The limit that we're solving 
this problem in is actually 

Az>R, 


and we'll think about this in terms of wavenumbers: for our wavepacket, Ak ~ x“ < z, so we're saying that Ak (the 
spread of the wavenumbers) is small, and thus our setup is close to having a well-defined momentum (which is what 
would be the case if Ak = 0). And as Ak — 0, we can indeed reduce our setup to an energy eigenstate problem: the 


incoming plane wave has some definite energy E, so that 


ikz—iEt/h 
Vin =e / , 


and the outgoing wave can then be parameterized as 


: ; ikr 
Pour = e7HEt/A Gi 4 Atogy? ) 


r 


for r >> R, because we want to consider the outgoing wave far away from the potential so that we essentially have a 


free particle. So we have Wout = Win + Wscattering, and we also know that the energy (for a free particle) must have 


_ 1k 
E= an 


Example 189 


Let’s now think about what's actually happening as we take Ak — O and understand why we can use the time- 


independent Schrodinger equation. 


Note that we're losing some causality if we write down the incoming and outgoing waves as above, since both 
waves exist at all times. To understand why this is okay, consider a wave packet solution where the distribution of 
wavenumbers k is given by some g(k) centered at some k® and with some width Ak. Our new w for the wave packet 


is now a superposition 


Woacket(k°, Ak) = f dka(kvrine(), 


and our outgoing wave Is now similarly 
“ eikr 
Wout = i; dkg(k) (o* + ~ fi) eT lEKt/h 
0 


We're taking the limit where this wavepacket has small Ak: recall that the group velocity of our wave packet is 


Vg = ak so the wavepacket crosses the region where the potential V is nonzero in time 
Az 
At ~ — 
Vg 


(remember that Az >> R, so the width of the wave packet is the relevant length that needs to be passed through the 
potential). As Ak > 0, we have At — oo, so we've basically stretched out solution out into an infinite plane wave 


which never finishes crossing the origin! 
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Similarly, we can think about timescale for the outgoing wave: suppose that the wavepacket is at the origin of the 
potential X = 0 at time t = 0. Then we want to know when the scattering wave starts — If the wavefront takes AT 
time to pass the potential, then the scattering starts at time t = ar because It’s halfway through at time t = 0. 
Since AT — oo, we see that this time Ar — —oo as Ak — 0, and that’s why we also see the scattering wave 
existing at all times. (So the time-dependence is simple — we have energy eigenstates because g(k) = 6(k — k°), and 


‘Et/h term is being affected by time.) 


now it makes sense for us to say that nothing else besides the e~ 
We'll now set up a problem in one-dimensional scattering that helps us say some things about three-dimensional 


scattering as well: 


Example 190 


Suppose we have a one-dimensional problem with an infinite barrier for all x < 0 and some localized potential 


V(x) for x > 0 so that V(x) = 0 for all x > R. 


If there was no additional potential, meaning V(x) = 0 for any positive x, we'd be scattering particles off of a wall, 
so our solutions would need to vanish at the origin and we'd have solutions of the form 
; elkx on e7tkx 
Wo(x) = sin(kx) = ————_., 
2! 
where we can think of e~'** as the incident wave and e’** as the reflected wave (together forming a standing wave). 
So for all x > R, our wavefunction now takes the form 
ikx+2i6«. __ eT ikx 
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e 


wy = = e! sin(kx + dx) 


for some real phase shift 6x depending on V(x), since we need |Win|? = |Wout|? by probability conservation. So we 


want to understand how V has affected the scattering, which we can see by calculating the scattering wavefunction 
Ws = by — Wo = e™ (7 — 1), 


which we define as e/**f, for the scattering amplitude f, (noticing that the e’** term means that everything is 


outgoing here). The probability of scattering due to the potential is then 


lWs|? = | fil? = sin? dx. 


And it turns out that we'll see an analog of this in three dimensions as well, remembering that we also have x = 


\f.(@, @)|* from lecture. It turns out that (as we'll see soon) the total cross-section 


can be written as a sum over partial waves 
4m : 
= BS pa (2 + 1) sin? Og, 
£ 


where we have a different phase shift for each partial wave. For now, though, we'll look at a particular example where 
the analogy is clear without needing to sum over partial waves: basically, choosing a particular £ to dominate gives 
us a one-dimensional problem. Specifically, if we have low-energy scattering so that k — 0, the s wave (that is, the 


£=0 wave) dominates. If we then imagine a central position V = V(r) bound in some region r < R, then recall that 
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we can write the Schrodinger equation in the form 


Ue e(r) 
r 


W(X) = Y¢,0(9), 


where there's no @-dependence because we take m= 0. We then get a differential equation 


( d? f+) 


dp 2 ) Uee(r) = kU g(r). 


But taking 2 = 0 and k — O, we Just have ~ Ue 0(r) = 0, meaning that U is a linear function of r which we can 
write as c(r — a). And we can be more precise by solving the Schrodinger equation with a finite k, so that we have a 
simple harmonic oscillator 

Uo(r) = sin(kr + do(k)) 


for the S-wave phase shift 69(k) (which is much like the 6, phase shift we observed above). As k > 0, we have 


in Uo) = Jin (k (r+ 9) ) =a (rt in 2) 


And now we can match up coefficients with the c(r — a) solution we had above — the most important feature is that 


we have the scattering length 


which tells us about how scattering behaves at low energies. In particular, sin dg behaves like —ka, so the cross-section 
will be 


Oa sin? 5p & 47a°. 


In other words, the unit-length quantity determines our cross-section, and it has a lot of physical significance for our 


An 
2 
wavefunction as well. 


44 May 11, 2021 


Last lecture, we started discussing scattering: we scatter particles off of a potential V(r), and we calculate how many 
particles hit each patch dQ of the solid angle. We then discussed that we would like to calculate the differential cross- 
section a which we do by solving the time-independent Schrodinger equation with energy E = _ (depending on 
the incoming beam), subject to the boundary condition 


elk-r 


UF) el + (0,6) — 


as r —> oo, where we need to solve for f, in the process. This then gives us the differential cross-section ae = |f,(0, 6)|?, 
and we can then get the total cross-section by integrating 0 = fangs. 

From there, the next step is to solve the three-dimensional Schrodinger equation, and we're doing so in the special 
case where VF) is spherically symmetric. Then we can look for solutions that are products of a radial and a spherical 
wavefunction, and we then reduce to the one-dimensional problem with an additional centrifugal barrier term. 

Last time, we noticed that in the spatial region where V = 0 (and we have a free particle), we have solutions given 


by the spherical Bessel and Neumann functions. Since we also know plane waves solve the Schrodinger equation in 
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V =0, plane waves must be a linear superposition of those functions, and in fact it turns out 
co 
el = Van S28 + 1i*%0(8)ie(kr), 
£=0 


and only the m = 0 terms contribute. If we then take the limit r — oo, we indeed see the incoming and outgoing 


spherical wave in the partial wave expansion 


el(kr—en/2) eWi(kr—£r/2) 


Van = ¢ 1 
= aoae » V20 + 1i Y40(8) 5: | 


r r 


In particular, in each angular momentum channel 2, the incoming and outgoing components have the same amplitude. 


So now we want to attack the problem in the region where V(r) 4 0. We again look for solutions of the form 


U 
y(7) = EMD, 0,6), 
as hed? f* e(@+1) hk? 
( 2m dr2 | 2m fr , vi) Vee(r) = Om Veal). 


(So we have an effective potential going as 4 from the centrifugal barrier, plus the localized V(r).) We also require 
that U(r = 0) = 0 (because the wavefunction should be finite at the origin, and it contains a utr) term), and we still 
have a one-dimensional free particle as r — co because both terms of the effective potential go to 0. So our solutions 
will look like 

U(r) = Fee’*” + Gee'*" 


as r — oo, in certain very particular linear combinations so that they also satisfy U(r = 0) = 0. Though we can't find 
the exact values of Fz, Gz without solving the Schrodinger equation, we can still say that |Fe|* = |G,|*. This is because 
we can imagine sending in particles with an amplitude Gy into the potential, , and then we get back particles with an 
amplitude of Fz, and there is no transition probability (because we can't have r negative, so the origin can be thought 
of as a hard wall). So the probability of reflection is 1, and that indeed gives us |Gg| = | Fy]. 


Therefore, Fe and Gg must differ by only a phase, and we'll parameterize this by setting 


U2(r) x al (kr—ém/2+260) = e7 (kre /2) 


as r —> oo, so that the phase is characterize by 26, and e7/% = (-1)2. (Here, 6, is a phase shift defined modulo 
mW, and it's different for different spherical harmonics 2. Note that we choose it in this particular manner so that 6g = 0 
when V = 0, as discussed earlier.) It will turn out that knowing all of these phase shifts 6, is enough to determine 
f,(0, @) and thus obtain our scattering cross-section. 

Returning to the Schrodinger equation, our general solution takes the form 


Ue.e(r) 
r 


wr) = ye Cem 


£m 


Yem(9, 6), 


and we're interested in those solutions that satisfy a particular boundary condition. So we'll match to that boundary 


condition above: because we've just parameterized our solutions in terms of the phase shifts 6,, we find that 


roo. 


w(r) ah Cem 


£m 


a 22 e—i(kr—£n/2) 


) Yam, 8), 


r 
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and we want to match this to our boundary conditions 


W(F) > eX + (8, d) 


elkr 
ee 


Everything here is a spherical wave except the plane wave term e!*7, and we use the Rayleigh formula now (writing it 


in terms of spherical harmonics) to get 


+ (8, @) 


Var ed 1 ellkr—er/2) e—i(kr—£r/2) elkr 
=——) V20+ 1% o= . 
k ys ae if r 
£=0 
So the first has an outgoing and an incoming term, and the second term has just an outgoing term. We now match 
our two desired solution forms for w(7) to find our unknown coefficients cgm — since the incoming wave only comes 


from the plane wave e/*? term and we know that expansion already, we have Cy = 0 for all m 4 0, and 


V4 1 
Qo = Vv 2¢ 4 tae 


Therefore, we find 


‘ Y2,0(8). 


_ V4 41 
ee 2+ 1i = ( 


Now that the incoming part of the solution and the boundary conditions are satisfied, we can now match the outgoing 


ellkr—er/2+262) e—i(kr—£r/2) ) 


parts to determine an expression for f,(@, 6). We find that (because outgoing wave has two pieces, from the scattered 


wave and the plane wave) 


Jin & af eikr Jar = oS ee 
9 1 ess i(—€m/2+20e)y, a) = 9 1 bs —ikn/2v, a) £,(0 ; 
(4 yy 2+ 11 ae 2,0(8) ae =p me 2+ 1i ae 2,0(8) + f.(8, d) ; 


en factors and moving both sums to one side, we get 


Canceling the 


co 


3 


qi t= 
f,.(0,) = 20+ ee (e7/% — 1) Ypo(8). 


k 


Ss 


=0 


And now because e~2/? = (—/), this cancels with the i%. And we can also manipulate the exponential terms to get 


a sine out of the difference of exponentials: 


(8,6) = = S¢ V20 + Te’ sin(5e)¥2,0(8) |. 
£=0 


So we've expressed our scattering amplitude solely in terms of the phase shift — in fact, each phase shift tells us 
the coefficient of the corresponding spherical harmonic Yeo. (And in particular, the scattering is @-independent, 
which makes sense because our system is symmetric around the z-axis. So the differential cross-section will also be 


g-independent.) This leads us to the total cross-section 


a= | Zar | arson, 


and inserting our form of *{ into this equation gives us 


4 = — 
= 3 dQS © V20 + 1V 20 + Lee" sin(de) sin(5e )¥4'o(8)Ye,0(8). 
La! 


But we know that our spherical harmonics are orthonormal, so [ AQY55(9) Yo0(8) = ee and our double sum collapses 
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into a single sum: 


c= aa S (28+ 1) sin? dy |. 
£=0 


So the total cross-section is the sum of the partial cross-sections 
4 ; 
O.= pe (28 + 1)sin?(6,) 
over all 2. And because é, is a real phase, we can make the partial wave unitarity bound 
4 


So conservation of probability (which was what gave us the reality of 6) basically tells us that there’s only so much 
scattering that can happen, which is a useful consistency check in particle physics. 


We may notice that our expression for o and for f, look very similar, and it turns out that we always have 
4a 
= Zh =0)). 


In other words, the scattering amplitude that is basically not deflected from the z-axis tells us about the integral of 
scattering over all angles! This is known as the optical theorem, because the scattered part of the outgoing spherical 


wave and the outgoing plane wave give us an interference pattern. Indeed, it turns out that 


Yeo(8 = 0) = \/ St * = im(4,(@ =0)) = 


and the imaginary part of e’® is sin dp, so we indeed get our sin? dg showing up like in og. 


lr 


ee) + 1)Im(e!) sin dg, 
£=0 


What's left for us to do is to calculate the phase shifts 6g, and those come from solving the effective one-dimensional 


Schrodinger equation for each angular momentum channel £. 


Example 191 (Hard sphere potential) 


Suppose we have the potential 


r<a 


otherwise. 


We can think of the scattering process as bouncing particles off a completely inpenetrable wall. Our effective 


potential is then 


h? £(2+1) 0° yaa 
Me) = ome TMOO= ) very | 
am 2 otherwise. 


We then need to solve the one-dimensional Schrodinger equation to get U,(r), and our boundary condition is now that 
U,(a) = 0. But we have a free particle with centrifugal barrier for all r > a, and we've already found that the solutions 


are the spherical Bessel and Neumann functions: 
Ue(r) = r (Aue(kr) + Beng(kr)) . 


To impose the boundary condition Ug(a) = 0, we must have 


Be _ _ Je(ka) 
Ag me(ka)' 
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and since we care about 62, what matters to us is the asymptotic form of the solution as r — oo. This requires using 


the asymptotic behavior of the spherical Bessel and Neumann functions, and we find that 


U(r) a : (4 sin («r — =) — By,cos («r _ =)) : 


which we need to match against 


U(r) teres el(kr em /2+25,) __ e i(kr en /2)_ 


Pulling out an e’® in the latter expression shows us that it is proportional to a sine term: 


Ln 


Up(r) x sin («cr — m + i) = sin (« — =) cos(dg) + cos (« - =) sin(dg). 


So we must have a ratio of coefficients given by 


Ae Be 


- _ Je(ka) 
cos dg sin Og 


tan dg = PCDI 


So for each k, we can compute the phase shifts 6, and that gives us the total cross-section 
An ae 
ars So (26 +1) sin* dg. 
£=0 


tan? x 


T+tantx: We have 


In particular, because sin? x = 


An Je( ka)? 
= — 2é+1)- : 
POS ie yi Vilkay? + nel kay? 


This example turns out to be very illustrative — if we look at scattering in the low-energy limit k — 0, we'll get behavior 
that is generic for short-range potentials. In this case, as ka + O, the spherical Bessel and Neumann functions have 


known asymptotics: 


£ 
Je(x) > Ea n(x) ee" 
where the double factorial misses every other term as it counts down (so 5!! = 5-3-1). So plugging in these forms 
into our phase shift yields 
tan 6, = : (kayo te i, 


(22+ 1)!1(2@—1)!! 
So then we can approximate tan x ~ x and find that 


(kaye? 


o¢*— OFF Te 1) 


So phase shifts become progressively smaller for large és, and the largest phase shift comes from the 2 = 0 channel. 


The scattering amplitude is then 
Van = ss 
,(0,o) = — S28 + e® sin(52)¥2,0(8), 
£=0 


and the dominant term comes from £ = 0 because we have a sind, © 6, term — that evaluates to —ka. This yields 


fu(8, 6) © Vr (_Ka)Yo0(6) - ve ka) = 


So in the low-energy limit, the scattering amplitude becomes angle-independent (uniform in any direction), so the 
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differential cross-section is just a2, and the total cross-section o = f s dQ = Ara? is the surface area of the hard 


2 


sphere! (Classically, the cross-section of the hard sphere is ma*, so the quantum case gives us a factor of 4.) We'll 


talk more about this angle independence next time. 


45 May 12, 2021 (Recitation) 


Last recitation, we made an analogy of three-dimensional scattering with one-dimensional scattering by thinking about 
having one partial wave dominate our sum over @ (and indeed, this is the case when we have low energies and thus 
low k, as we saw in lecture). One point is that as long as V is zero outside of a localized range, we can always define 
a scattering length a. 

Today, we'll continue that discussion — first, we'll go to higher orders instead of Just keeping the leading-order 
contribution. We often get combinations that look like e/ sin dg (where the 0 subscript indicates that we're looking 


at the 2= 0 partial wave), and to motivate how we do our Taylor expansion, we'll rewrite this as 


_ sind sin do 1 


e-'% ~ cosdg—isindg  cotdg —i 


So it turns out that we should expand in this parameter cot dg for small k: we have, as k > 0, 
oe 4 
k cotdo ¥ —— + Stok + O(k*) 


(recall that a is defined as the limit limz +o po), This is called the effective range expansion, and if we do this for 


higher partial waves d¢ as well, we'll get similar-looking expansions. 


Example 192 


We'll now connect this discussion to that of the hard sphere, where our radial potential is co for all r < b and O 


otherwise. 


We must have w(r = b) = 0, which gives us a condition on our wavefunction 
Up ~ sin(kr + 60). 


This gives us a condition on our phase 69 = —kb, and we'll notice that the scattering length a is just the radius of 
the ball b. If we draw the wavefunction Up(r), we have a horizontally shifted sine function which vanishes at r = b. 
(In words, we can say that a repulsion from the potential V gives us a negative phase shift 69, which pushes the 
wavefunction outward from the origin.) 

If we focus in on low-energy scattering and take k — O, our sine function also becomes more linear. And to 
illustrate what's going on with the scattering length, if we replace our infinite well with a finite but positive well, the 
scattering length a will satisfy 0 < a < b (it won't push the wavefunction outward as much, but it will still have some 
effect). Remember that because our sine function will look linear near the x-axis (Up © c(r — a)), the interpretation 
of a in this case is the intercept of the linear part of the wavefunction where V = 0, and that will now be before 
r = b. (And similarly, an attractive finite square well will pull the wavefunction inward, making the phase shift positive 
and thus making a negative.) 

If we keep making this square well V = —Vo deeper and deeper, our scattering length a will get more and more 
negative, and it turns out that a > —oo at a finite value of Vo, where the wavefunction in the outside region becomes 


flat. A (normalizable) bound state will enter the problem with k = 0 at this infinite scattering length — if we then 
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further decrease Vo, our bound state’s energy also decreases, and our wavefunction in the outside region has phase 
shifted so much that the value of a becomes positive again (since the slope of the outside region is now negative, 
rather than positive). 

Continuing to make Vo small, at a certain point, we get an intercept of a = 0: remembering that the cross-section 
should be 47a for low-energy scattering, this tells us that there is a suppression of low-energy scattering for certain 
potentials Vo. And then finally, when we reach a phase where a = —oo again, we will get another bound state appearing 
at the threshold k = 0. We can understand the bound states by citing the solutions to the finite square well, which 


satisfy 


1 1 
k’ cot(k’b) = ~_V—2mE, = AV 2mE +W). 


So if we want to understand the special case where the states show up at k = 0 = > E = 0, we have the condition 


1 
that cot(k’b) = 0, and thus we must have b = ae 


We'll now turn to a discussion of partial waves: recall that we can approximate our solution for r away from the 


for some integer n. 


potential V as 
: elkr 
(7) = e + (8) 


pt 
where 


Lee 
f(8) = 5 S| e!® sin(5¢)(V4aV 20 + 1Y)0(8)) 
£=0 
and the differential cross-section is just ae = |f,(9)|?; integrating over solid angles and using orthogonality gives us 
4n 9 
os S 9 (26 + 1)sin*(62). 
£ 


So the main task we have is to calculate the phase shifts 6g — we'll talk more about this in lecture, but the idea is to 


write our wavefunction as a product of radial and angular dependence 
v= S—Re()Y%0(8). 
£ 
and we'll solve this for each 2 separately. If V is zero at some r > R, then we know that our solution is 
Re(r) = Auie(kr) + Bene(kr) = AgUe(kr) — tan dene(kr)), 


and we calculate 6g by solving the Schrodinger equation in the region where V is nonzero, in particular decomposing 
into radial and angular parts again. We can then match both the radial wavefunction Re(r) and its derivative Rj(r) at 


r =R, and that will give us a formula for 6g. 


Example 193 


Suppose that we have a potential of the form 


oe r<b 


VINS KV Deir <e 


0 aeaGe 


For simplicity, we'll just try to compute the 2 = 0 phase shift d9(k). 


In the region b < r < c, we know that we must have sine-type solutions with wavenumber given by k? = oP (E+Vo), 


but we need to pick the sine function so that it vanishes at r = b because of the infinite well on the left. Therefore, 
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for b< r <c, we must have a solution of the form 


Up(r) = A sin(ka(r — p)). 


We then compare that solution to the solution in the r > c region; if we plug in the spherical Bessel and Neumann 
functions for 2 = 0, we find that for r > c, 
€ 


Up(r) = k 


sin(kr + 60). 


We must match up these wavefunctions and their derivatives (or we can equivalently match up Ro = Yor), and one 


trick to get simple calculations is to ask for ue to be continuous (since this will get rid of the coefficients in the 


front) at r= c. We find that 


ky cos(ky(c — b)) _ kcos(kc + 60) 
sin(ki(c—b)) ___ sin(kc + 69) 


meaning that 
ky cot(ky(c — b)) = k cot(ke + 4p). 


The only thing that is unknown here is 69, so we can solve for it here, and we find that (reciprocating the above 


formula to use tans instead of cots) 


6o(k) = —kc + tan“! (F.tan(ia(c = »))) 


We can check that various limits of this expression work: if we set Vo = 0, then we get the hard sphere solution, and 
expanding in 69 will make k & k, and thus the tan~+ and tan cancel out. And if we think about the scattering length 


as k > 0, setting b =0 so that we just have a single finite well for simplicity, we find that 


J2m\K k 
k= ~ Cay => bo(k) = —ke+ reel: 


This gives us the scattering length 


1 
a=c-— —tan(yc), 
Y 


and thus a is not quite c for the finite well. Instead, it can take on various real-number values, and this connects back 
to the argument that we made with the slopes of the intercepts earlier in the recitation and justifies the values for 


bound states! 


Remark 194. /t can be shown that for £=0, we get 


ole ph tes cbc — 
fro = ze" sin(5o) = k(cotdy —i)  —4 — ik 


by taking the small-k approximation. So for small k we have |f|? ~ a, but if we send a — oo first, then we get a 
scale-invariant problem and o is proportional to & instead. So as k + 0, we do get a large cross-section, and we need 


to consider higher-order terms to understand what's really going on in the system. 


46 May 13, 2021 


We've been analyzing scattering for a central potential V that only depends on the distance to the origin — the idea 
is to decompose into different angular momentum channels, solve the one-dimensional radial equation with effective 


potential, and study the asymptotic behavior of the wavefunction for r — oo. We then defined the phase shift 62 so 


161 


that Up(r) x sin (kr — i + 6¢) as r —+ 00, so that we can find the scattering amplitude f,(@) and the differential and 
total cross-section in terms of that phase shift. 

We then applied this formalism to a special case of a hard-sphere potential, finding that in the low-energy limit 
ka<1, we have dy « —(ka)?&1, so that the £=0 channel will dominate the scattering. We then find that f,  —a 
becomes angle-independent, and the cross-section becomes 47a. A natural question to ask is whether this behavior 


is a general phenomenon, and it turns out that this is common to all short-range potentials: we will generically have 
5g x k**t1 as k > 0. 


This motivates defining the scattering length 


which can be positive or negative depending on the problem, and this leads us to the familiar 
qo 7 as o = 4raz. 

But there's an exception where something different happens, and that’s when we have bound states in the potential. 

If this bound state’s energy —E, approaches the continuum threshold, meaning that E, — 0, then our scaling behavior 

will break down, our scattering length will go to oo, and the phase shift will go to 5 (so that sindg becomes maximal in 

magnitude). (This has applications to cold-atom physics and Feshbach resonance, where the tuning of our potential 


comes from an external magnitude field.) 


Example 195 


We'll now explore the reason that we expect 69 « k generically, leading us to & being angle-independent. 


Recall that the only breaking of spherical asymmetry in the problem comes from the direction that the incoming 
particles are coming from, but as k — 0, the length of the wavevector goes to zero, so we lose the sense of 
directionality. Another way to say this is that as k > 0, we have a very large wavelength for our incoming wave, which 
will be approximately constant over the size of the short-range potential. This means that the plane wave will appear 
less biased in one particular direction. 

From here, we'll take a step back and try to develop methods that work when we don’t just have central potentials. 
(But we'll still assume that V + 0 as r — 00.) We're going to need to do perturbation theory in the potential V — 
this is called the Born approximation — after all, even in the spherically symmetric case, there's still quite a bit of 
work in solving the one-dimensional Schrodinger equation. 

Let's start by being exact. We now need to solve the equation 

2 2 4-2 
(-Evva)w@=e0m, 6-5 


2m 


We'll rescale our potential as V(F) = F u(r), so that we need to solve the equation 


(-V? + UWA =O = [(V?2 +E WH = UHV) 


We'll now turn this differential equation into an integral equation as follows: suppose we have a Green’s function for 


the operator (V2 + k*), meaning that we have a function G(7) satisfying 


(V? + k*)G(7) = F(7), 
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or equivalently (V2+k?)G(F— 7) = 63(F—7). Then this Green's function can be used to turn our differential equation 


into 


vn = Yo(F) + / PAG(F-7?)UP (7) | 


where Wo(F) satisfies the homogeneous equation 


(V? + k*)Wo(7) = 0. 


Proposition 196 


The two boxed equations above (differential and integral) are equivalent. 


Proof. Suppose that 7(7) satisfies the integral equation. Then applying the operator (V2 + k?) on both sides, we get 
(Vie + k?)y(F) =0+ | a7 (V2+ k?)G(F— P)U(P)W(F), 


where we've moved the operator inside the integral (convergence issues are not a problem as long as our short-term 


potential is well-behaved and decaying). Now by the definition of the Green's function, we get that this is 


= / PH5(F — PUPP) = UPO(A), 


which is the differential equation that we want. On the other hand, if we start with the differential equation, we can 


just define 


bo(F) = 47) - if PAG(F— PUP O(?), 


and we can verify that (V* + k*)wo(7) = 0 by doing the exact same manipulations. Then the integral equation is 


clearly satisfied by our choice of Wo. 


It turns out that there are two different Green's functions that work for us in this case: we claim that they are the 


incoming and outgoing waves 


1 etikr 
G = —— 
+(7) 4n or 
where r = |r|. We can verify that this works: first, we check that for all 74 0, we have (V? + k?)Gi(7) = 0. By the 
Laplacian formula in polar coordinates V* = -32(r 2), where we're only using the radial part, we have (just doing 


the + case for concreteness) 


; hb alkt 
V°6,(7) = aoe (*( 1 ike 5 tke i 


4n r? Or 
which simplifies to 


110 . 
_ - 3 a elkr ikre'*") = 


11 
ee 


ikelkr ikelkr k? elk") = elke. 


which is indeed —k*G,(7). (Taking a complex conjugate on both sides gives us what we want for G_, as well.) And 


then for 7 = 0, we notice that G,(7’) -> — z= because the phase factor goes to 1, and indeed V? (—z-) 6°(7) (we 
may remember this from electrostatics). 
So now that we've verified our Green's function works, we can plug this into our integral equation. And knowing 


which Green's function to use, as well as which wo(7), can be found by looking at the boundary conditions we're 
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imposing as r — oo, which are 
, elkr 
w(7) = el? + (8, $)—. 


So it makes sense to make Wo(F) the plane wave e’*7, and it makes sense to use G;(F), because our integral equation 


then takes the form 
ap(7) = efk2 + / a7G,(F—F)U(P)Yp(P). 


We need to check that the second term here is indeed a spherically outgoing wave so that our boundary conditions 


ik\F—7*| 
r) = 1 (cals > 1 
4n |r-?| 


effectively think of our integral as getting significant contributions from a finite region for 7’ near the origin. And as Fr 


are satisfied — indeed, because G1(7 and the potential U goes to 0 rapidly as r + oo, we can 


gets very far away from the origin, we can expand the denominator as 


: Fo 
7-7 = VaR R= Vor Pa rf 25 


Since r > r’ here, we'll approximate this square root with the binomial expansion as 


r 7 
wr(a- 5-7) =r rr 
2 


And we can actually simplify our Green’s function form now: the denominator can be approximated as r, and the 
numerator we have to be a bit more careful with because things are coming in as a phase. Basically, as r > oo, we 
have 

Giilr= 7’) ~ 1 elkr go ikhr 
so that the integral equation as r — co becomes 


elkr 


pe 


In other words, all of the #dependence has been taken out of the integral, except through 7, which tells us about the 
angle (6, @) of our vector. So this fits our desired boundary condition e'*? + (0, og) as desired! 

We now have an expression for the scattering amplitude, but it’s misleading, because it still depends on the 
wavefunction w(7) over the range of the potential. This means we can’t get away from solving for w, and here's 
where we'll do our perturbative expansion. We'll introduce some notation first: let kj = k- 2 be the wavevector of the 


incident beam, so that we can rewrite our integral equation as 
W(P) = hr 4 / PPG, (F—F)U(P)Y(P). 
We'll iterate this equation by plugging in w(7’) back in for itself: 
WP) = el 4 / P7G.(r- P)U(r) Ga - / dPr'Gy (7 — nue we)) ; 
so that we can split the contributions into a term with one integral and another with two integrals: 
w(r) = sere | P7G.(7- P)U(P) ele" + f Pre,(7- rue) f Pec. (7 =F Ur tr). 


This iteration procedure can be repeated by substituting in an expression for w(7”), so we have a term with a double 
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integral and another with a triple integral: if we keep iterating this, we'll end up with an expression like 


WA) = eltre | P7G,F- Ue + f P76, (7 Pur) f Pra, — PUP) eR 


; / PPG. (F— FUP) i. PPG, (F — FUP") / PONG. (P — PUR eT 4... 


In other words, every successive term has an extra integral factor, and each one has a phase term for its corre- 
sponding variable. What we're really doing here is doing a perturbative expansion in U (or equivalently V), but let’s 


first do the first Born approximation, where we just keep the first integral term. Then we have 
WF) x eh F 4 / d7G,(F— PU(A)". 


Then as r > oo, we use our expansion of the Green’s function from earlier, and we find that 


yy 2 ors (2 f Prem ruryer) 
4a ro 


We can then read off the scattering amplitude 
1 se ea y 
fe(.6) = — 5 faerumyeen 


If we then let ke = k-f be the momentum of the scattered particle, we can define K= ke _ kj to be the momentum 


transfer between the incident and scattered particle, and our scattering amplitude then becomes 


1 i. 
f(0, 6) = —a f PreRr U(7’)|, 


to first-order in U. (In words, this is very simple: to get the scattering amplitude, we just need to take the Fourier 
transform of U, with wavevector given by the momentum transfer! ) 

Looking more at the geometry of this scattering process, we should remember that the scattered momentum is 
the same length as the incident momentum, but it points an angle @ away from the z-axis. The length of k can then 


be obtained with the law of cosines: 
= 5 = Sy 7 
K? = k2 + k? — 2k, - kj = 2k? — 2k? cos @ = 4k? sin? z 


In other words, the length of K will be 2ksin g. And if we now think about the special case where we have a central 
potential, U(7’) only depends on the length of 7’, and we integrate over all 7 so the integral for *{(@, @) only depends 
on the length of |K|. Since |K| only depends on @ and not @, this is consistent with what we've previously found. In 


such a case, the calculation simplifies dramatically: if we choose our new z-axis along the k-direction, we find that 
Fr =r'(sin©cos®, sin Osin ®, cos©), 


and then we have , 7 - 
fy = -z | ar | sin ede | debe IKrcosO u(r’), 
4m 0 0 
The ® integral gives us a factor of 27, and if we let w = cos@, we can evaluate the integral to be 


—iKr'w 1 


i ae Ye, iid A 5% e 
=-3f¢ ar [due : mute) =—5 fr dr'U(r') IR la! 
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simplifying to , 
ik = -z/ dr’ U(r) sin(Kr'). 
K Jo 


And because U(r) = 2eV(r), we get our final expression for the scattering amplitude in the spherically symmetric 


case: 
2m f° 


f.(0) = aK I, dr'r’'V(r')sin(Kr’), 


where K = ksin g. We'll talk more about the higher-order terms and validity of the Born approximation next time! 


47 May 17, 2021 (Recitation) 


We'll discuss the Born approximation for scattering today, doing a review and then applying it to some examples. 
Recall that we’re trying to solve the scattering problem, which is the Schrodinger equation with wavefunctions 
suitable for scattering: 


(V? + A) w(7) = UFV(F), 


where U(F) is the rescaled potential U = 2OV and E = ue To do this, we introduced the Green’s functions, which 


allow us to rewrite this differential equation as an integral equation 
vA) = wl) + f Pree MUFF), 


where (V2 + k*)qo(7) = 0 and (V2 + k?)G(F— 7) = 639(F— 7). To make this specific to our scattering setup, we 


used the Wo = e'*? to set up the incoming wave, and we used G(7) = —{+e/K" 


ae as an outgoing spherical wave. And 


the Born approximation comes in when we try to solve the equation: since w appears on both sides of the equation, 
we can substitute it back in for itself above in an iterative way. If we assume that w wo, we then get the first-order 
approximation 


W(F) = bo(F) + i PA G(F—P)U(F)Wo(?), 


Since we're interested in the scattering at large r, and we know the potential is localized, we can write that there is 
some b such that U(? > b) = 0. Then for r > r’ ~ b when the integrand is nonzero, we can expand the Green’s 
function to get 


elkr 
r 


W(7) =o + —A(6, O) 4°, 


where 


1 ae 
fF (6,6) = — = f erent Ur): 


In other words, to find the (first-order approximation) of the scattering amplitude, we're doing a Fourier transform of 


our rescaled potential! Here, recall that q is the momentum transfer (denoted K in lecture) 
G = kr — kz, 


where we can think of a particle coming in with momentum kzZ and scattering out with momentum kf. (So the 
direction of ? is the reason that this expression for AO) depends on the angles 0, ¢.) And doing some algebra, we found 
that the magnitude of the momentum transfer g = |qg| = 2ksin g. 

One thing we haven't discussed yet, though, is when this Born approximation is good. Essentially, we want to 


make sure that w is close to Wo, and we can do that by making the first-order integral term small. One way to do this 
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is to ensure that ' 


fi 
Vi<—, 
so that the potential is small, and another way is to look at the case where the momentum k is very large (since this 
makes the “free energy” term wo larger and larger): 


hi 


VI < (kb). 


Basically, when k gets large enough, the integral oscillates quickly and thus doesn’t give a large contribution relative 
to Wo. So because the Born approximation looks at large k instead of small k (like for partial waves), the two methods 
of solving scattering problems are complementary! 

If we now look at the special case where V is a central potential (as in lecture), we can reduce the form of the 


solution to 


a : 
FY (6) = =, dr’ (Gave) sin(qr’) |. 


We can notice that the whole function here is now just dependent on gq = 2k sin § (not on k and @ independently), 


which is different from the approach with partial waves where k and @ appeared separately. And if we now turn to the 
differential cross-section j 
ro) 
—— = (fF (6)/? 
<= |F D6), 


we'll notice that the sign of the potential drops out, because replacing V with —V just adds an overall sign to fF). 


Remark 197. Note that f) (8) is always real, but by the optical theorem, the imaginary part of #0) (which is 


zero) should be related to the total scattering cross-section, which seems odd. And the resolution here is that 
a= f aire? 


is already second-order in V, so we need the second-order Born approximation to get that this is 41 im) (0). 


Finally, we'll probe a few particular limits. If we now look at the regime where kb < 1, and we have a small 
potential |V|, this means that we're looking at the limit q > 0. Then we find that sin(qr’) and the 7 in our integral for 


A) cancel out, and the phase goes away. So indeed, i? is an integral over the potential without any g-dependence: 


1 2m 


fFY(6,9) = ae rae eel 


so the angular dependence drops out as expected. And if we look at the regime where we have large k, we'll make q 
large and make sin(qr’) oscillate more quickly. So the bulk of the contribution to the integral comes from the parts 


where q < i and that gives us a formula for k and @: 


In other words, we get a contribution from the range of angles where @ is close to 0 (of order a) meaning that little 


deflection happens. 


Example 198 


Let's now look at some particular central potentials V and see how they affect the boxed expression for f°) (8) 


2 
above: first, we'll analyze the Gaussian V(r) = Voe 22. 
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In this case, we find that 

2m fa ingae) -3y 
— rrsin(qrje 26 
hq 5 qd 0, 


and since we know that the Fourier transform of a Gaussian is a Gaussian, we can find that the answer for the amplitude 


F(@) =— 


becomes 


_ = fg? mV b? ange 
Taal ‘ 


(Notice that we're assuming b is small enough so that the mvs b? term is small, which is where the Born approximation 


comes into the picture.) We then have 


da _ mb? (mVob?\* _ ap 
dQ 4 ( Fe ) co 
Then the differential cross-section for a fixed @ as a function of k will be (the right half of a) Gaussian: by the time k 
is of order é it will die off. In other words, once we get past the size of the potential, we don’t see any cross-section, 
so we can get some fundamental information about the potential by scattering particles off of it. 

On the other hand, if we fix k and look at the differential cross-section as a function of 6, the answer will depend 
on how large k is: if k is very small, then q will be small regardless of @ and thus the cross-section is almost uniform 
in @. But as k gets larger, we start seeing the Gaussian shape, and the differential cross-section is largest for small 0, 


as shown below: 


do 
dQ 
a) 
q= .3sins 
g = 12sin 
b+ 


Example 199 


Finally, we'll consider a finite spherical square well, where V(r) = Vo for r < b and O otherwise. 


Plugging things in and doing the integral, since V(r’) is constant on the nonzero part, we're basically integrating 


rsin(qr) across a finite range, and it turns out that 


2mVo,. 
ae (sin(qb) — qbcos(qb)), 


F(@) =— 


and we can rewrite this in a similar form as above to get a dimensionless function of q: 


do mVob2\* (sin(qb) — qbcos(qb))? 
mn (Te) 


—_— = 4p" 
h? (qb)° 


We now get oscillatory behavior in the differential cross-section, and there are points where a = 0. So if we vary 


our energy and look at a fixed angle, we'll see that there will be total transmission at certain values of k and thus 
no scattering into this particular angle! And that’s something that is indeed experimentally observed as well (the 


Ramsauer- Townsend effect). 
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48 May 18, 2021 


Last time, we discussed the Born approximation, which is a way to do perturbation theory (in the strength of the 
potential) for scattering. Basically, after defining the rescaled potential U and Green's function G;, we found that we 


can write a solution for the wavefunction with appropriate boundary conditions as an infinite series 
(7) = era | PRG F- A)UA)emE 4... , 


where the Ath order term has k integrals, k copies of the Green's function, and k copies of the rescaled potentials. 


Truncating at this first order approximation gives us an amplitude asymptotically as r — oo that looks like 
fe(8.8) =— 7 f Pre UG) 
; ce : 


where k is the vector difference between the initial and final momentum. (So we have a Fourier transform of the 
potential at a particular wavevector k.) 

If we think about what's going on pictorally here, first note that the zeroth order term eK? is the incident plane 
wave (evaluated at some position 7), and the first order term is an integral over the range of the potential. Over this 
range, we can imagine the plane wave elkvFi hitting a point 4, and from there we multiply it with the value of the 
potential U(F,) and also the value of the Green's function (also known as a propagator) G.(7— 7). In other words, 
we can think of a particular point ~ as corresponding to a particle scattering off the particular location 7, into /. 

And with this type of analysis in mind, notice that we can think of the second order term (with two integrals) as 
hitting the potential at a point %, gaining a factor of U(r), and then rescattering to another point in the potential 
through the propagator G.(7 — 7%). We then gain another factor of U(7,) and then rescatter to our final destination 
with the help of G;(7— 7%). So the point is that the kth order term can be geometrically thought of as scattering 
off of k different points in the potential before reaching the final destination! And this is actually getting us into the 


beginnings of Feynman diagrams, though we won’t keep going in that direction. 


Example 200 


We can now aim to be more quantitative with our study, understanding when the Born approximation is valid. 


First of all, we expect that the first order term should be much smaller than the zeroth order term, so we will 
compare their magnitudes. Denoting Yo =e" and wi = f d?AG,(F— A)U(A)e*, we want || < [ol = 1. 
Since w, = (6, oe as r — oo, the term in the denominator shows that || < 1 in that regime, and the dangerous 
regime is the one where 7 is inside the potential. 

Suppose that the characteristic length scale of the potential is a, and suppose that the typical magnitude of the 


potential over this range is |V|. Then (writing out the Green's function) 


1 3. eka ne 
= d°r,—— U(f, )e’""". 
v1 z/ 1Foa (A) 
Since this integral is over the range of the potential, we can estimate this as 


1 
~ a-=|UI. 
a 


(we might worry that there might be divergence as fF, approaches /, but the volume element cancels that out). So our 
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condition is that 


2m|V|a? i 
—5 <1 = | VIS et 
ma 


This is the weak-potential condition, and we may remember it from the study of attractive potentials: if we have 
an attractive potential with depth |V| and width a, then this condition is the condition (in three dimensions) that 
guarantees that we have no bound states. 

But we know that the Born approximation is supposed to tell us about scattering states, so we need to look more 
carefully to understand why bound states are relevant. Basically, at V = 0, we only see scattering states, but once we 
get to a deep enough well, we suddenly see a bound state appearing! So this is non-analytic behavior, and in particular 
it will not be picked up by perturbation theory. So we need to make sure bound states do not appear for the Born 
approximation to make sense. (And to get this to apply for repulsive potentials, the argument is that the radius of 
convergence is the same whether we have negative or positive coefficients. ) 

But there’s another situation where we can make more flexible approximations — when the particle has high energy, 
it is not affected very much by the potential, so the effect is also weak. If we consider the limit where ka > 1, then 


we can instead have 


fi 
VIi« ran Ka) 


(so the factor of ka allows for our potential to be larger). The reason for this is that the phase term e’*!’—“I oscillates 
quickly when k Is sufficiently large, and that washes out most of the contributions of the integral. 

This concludes our discussion of scattering, and we'll now move to the final topic of the course, which is identical 
particles. This is a particularly important topic in lots of fields of modern physics, and it comes up whenever our 


system has more than one particle. 


Definition 201 
Two particles are identical if there are no experiments that can be done to distinguish their intrinsic properties, 


like spin, charge, and magnetic moment. 


Example 202 


As far as we know, all electrons are identical, and so are protons and neutrons (even though the latter two are 


not elementary particles). Similarly, hydrogen atoms are identical (even though they can be in different states at 


a particular time, those states can be realized by both atoms). But a proton and neutron are not identical. 


In classical mechanics, if we have multiple electrons, we will often label them and call one “electron 1” and another 
“electron 2,” and we can then write down a classical Hamiltonian which satisfies H(7, 01, %, P2) = H(f, bo, A, pr). 
And if we shoot two electrons at each other and they scatter off of each other, their trajectories will be defined at 
every point in time, so we can track which electron is which. 

But in quantum mechanics, if we imagine scattering two indistinguishable particles at each other and detecting 
particles at two outgoing detectors, we cannot answer the question of “which particle hit which detector,” because 
there is no well-defined notion of a trajectory (we can't measure their positions during the scattering because that 
would collapse the wavefunction). So we need to be more explicit about how to define quantum states of several 
particles. 

We know that if our particles are distinguishable, we can describe them using tensor product spaces: letting Vy 


be the Hilbert space of particle 1 and V be the Hilbert space of particle 2, the total Hilbert space will be the tensor 
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product V = 4 @ Vs. In particular, if Yi is spanned by {|u;)} and Vs is spanned by {|v;)}, then V will be spanned by 
the vectors {|u;) ® |vj)}. (We sometimes put subscripts 1 and 2 on |u;) and |v), respectively, to indicate that they 
correspond to the first and second particles.) 

But suppose we have two indistinguishable particles — for simplicity, suppose we have two electrons, and we'll just 
focus on the spin states here for simplicity. Then each particle can be in the |+, Zz) or |—, z) state, and together the 
two electrons can either be in the |+), ®|+).5 state (both up), |—), ®|—)» state both down), or in a state where one 


is up and one is down. 


For this last state, it seems that we can either use |+), ® |—)» or |—), ®|+)5, but the two particles are identical 
(so making a choice of one or the other is odd). It makes sense to try to identify the two states, but we run into 


trouble if we try to do that — instead, the answer is that we must use symmetric and antisymmetric states 


IS) = Za @| ytise. A) = (4 8 | pee: 


It turns out that if we have bosons, then |S) is the correct combination to use, and if we have fermions, then |A) is 
the correct one. 

More generally, let’s say that we have two identical particles, and for each particle, we have the Hilbert space V 
spanned by an orthonormal basis {|/)}. The two-particle Hilbert space then lives in V @ V, but we have to put further 


constraints when we have identical particles. 


Definition 203 


The permutation operator P,, is defined by its action on basis states of V @ V: 


Pr2 |i), @ U)2 = UV). @ Io - 


Proposition 204 
For bosons, our wavefunction |) for the two identical particles must satisfy Pi. |w) = |), and for fermions, our 


wavefunction must satisfy Pi |w) = — |). 


If we denote the spaces satisfying those conditions as the symmetric subspace Vs and the antisymmetric subspace 
Va, It turns out that V@V = Vs BVy for two particles, but this will not hold more generally for more than two particles. 
The space Vs (for bosons) is spanned by the states |/), © /)> + /); @ |/)5, and normalizing for different values of /, / 


gives us 
1 


V2 


On the other hand, the space V4 (for fermions) is spanned by 


1), @lio. (1), @ Wot), @li), i#i. 
a 2 ee , 
alls @ 2 — Ur @ la). iA, 


this time noting that we can’t set / = / because that gives us a state of zero. So this leads us to the Pauli exclusion 
principle: two fermions cannot be in the same state because of this permutation operator. 


If we now return to the case of two spin 1/2 particles, we have a basis (dropping tensor product symbols) 


{li 4). 3 | 1, es 2} 
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for bosons, and a basis of just one element 


for fermions. (So if we have two electrons with the same spatial wavefunction, they can’t both be spin up.) 


Fact 205 


In nonrelativistic physics, there is no indication of which particles should be bosons or fermions. But quantum field 


theory gives us the spin statistics theorem in three spatial dimensions, which tells us that particles with integer 


spin (like pions and the W and Z bosons) are bosons, and particles with half-integer spins (like electrons, protons, 


and neutrons) are fermions. 


In particular, notice that a hydrogen atom is a boson because it has a proton and an electron (so integer total 
spin), and some isotopes of helium are bosons while others are fermions because of the number of neutrons. 
Suppose now that we have WN particles, so that our wavefunctions live in the tensor product space V @V @---@V. 
We can then label our states as 
li1)1 le)o-+- lin) vs 


and again we have an additional constraint on our states: 


Definition 206 


The transposition operator Pygmy is defined by its action on basis states: 


Poem) |) [l2)o° ++ Vlede >> Vin) m= Viv) = Vt) Veda => Vine Ved Ln - 


Much like before, we now require 


Pom) |¥) = =|) — for all mA £, 


where we require the positive sign for bosons and the negative sign for fermions, and we similarly define the symmetric 
and antisymmetric subspaces here. And this time, we build a basis for these subspaces by thinking about the structure 


of permutations. 


Definition 207 


A permutation a € S,, is a bijection a: {1,2,--- ,n} > {1,2,--- , n}, meaning that a(/) 4 a) for i # /. 


For example, there are 6 different permutations for n = 3, which we can represent as 123 (identity), 213 (swapping 
1 and 2), 132 (swapping 2 and 3), 321 (swapping 1 and 3), and 231 and 312 (cyclic shifts). More generally, there are 
n! permutations for permutations on n elements. The idea is that we can build a bosonic state by symmetrizing one 


of our initial unconstrained states: if we have N particles, then 


LP) = D5 lheway) |e(2y)o°°* exc) wy 


acSy 


We'll discuss this more next time! 
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49 May 19, 2021 (Recitation) 


We'll discuss identical particles today: the idea is that indistinguishable particles (such as two electrons) behave in a 
way so that swapping them keeps probabilities the same: letting a1, @2 be quantum numbers of the “first” and “second” 
particle, 

dV dVa|p (a1, a2)? = AY dV4|b(ar2, a1)’, 


which means that (a1, a2) and (a2, a) can only differ by a phase e®. But we can swap twice and get back to 
where we started, so that 
W(a1, a2) = e?p(a2, a1) = e?°W(ar, a2), 


which can only happen if our original phase e’ is either 1 or —1. We can formalize this by using the (Hermitian) 
exchange operator Pio, taking W(a1,a2) to W(a2,a1): then we have P2, = |, and [Pi2,H] = 0 (because the 
Hamiltonian doesn't know which particle is which, meaning it is unchanged by exchange). 

So we can think about the simultaneous eigenstates of P;2 and H, and things are a little deeper in that eigenstates 
of Piz with eigenvalue 1 are bosons (giving us a wavefunction symmetric under interchange), and eigenstates with 
eigenvalue —1 are fermions (giving us a wavefunction antisymmetric under interchange). And it turns out that bosons 
have integer spin, and fermions have half-integer spin — this is something we can prove using relativistic quantum field 
theory. 

If we use the notation |a, 8) = |a~), ® |B), where we have two identical particles indexed by 1 and 2, then we can 
form the symmetric and antisymmetric wavefunctions 

1 1 
v2 v2 


for all a 4 6. And when a = B, we have the symmetric wavefunction |@, a), but there is no corresponding antisym- 


(lo, 6B) + |6, a), (lor, 8) — |B, @)) 


metric wavefunction: as we mentioned in lecture, we can’t have two identical fermions in the same state, and this is 


the Pauli exclusion principle. 


Example 208 


Suppose we have a one-dimensional harmonic oscillator potential, and we have five electrons inside this potential 


(but with no Coulomb interactions between them). 


We know that electrons are spin 1/2 fermions, and so it makes sense to ask about questions like the energy and 


spin of the ground state and excited states and how the wavefunctions look. First of all, the Hamiltonian looks like 


where Hj is the harmonic oscillator Hamiltonian for the ith particle (assuming no spin-dependence), and we can label 


our states as |, M2, 23, M4, M5) where n; > 0. Then the energy is 


5 
E= (m+ m+ mtn tng + 5) hu 


If we want to find the ground state energy, then we want to minimize the njs, but we can’t throw all of the electrons 
into the ground state because of the Pauli exclusion principle. Instead, we can fill the ground state with a spin-up and 
a spin-down electron, then fill the first excited state with a spin-up and a spin-down electron, and finally fill the second 


excited state with the final electron (so that we have (1, M2, 73, 14,5) = (0,0,1,1,2)). This gives us an energy of 
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eS (4+ 2) fw = Shu, and there is a two-fold degeneracy of the ground state because the final electron can be 


spin-up or spin-down. 


Remark 209. Note that just writing down the Hamiltonian H above does not uniquely determine our system — we 
wouldn't need the same constraints if we had bosons instead of fermions. So we need to specify what is in our 


Hamiltonian too! 
If we now think about the total spin of the ground state, notice that the first and second particles have identical 
(symmetric) spatial wavefunctions, so the spin wavefunction must be antisymmetric: this gives us an expression 
1 
V2 


meaning that we get a total spin-O state. Similarly, the third and fourth particles must be in a spin-O state that looks 


(IN) — Wt), 


as above, because of symmetric spatial wavefunctions. The final (outer) electron then must have S = 5, and thus the 
ground state is in a state of total spin S = 5. 

From here, we can look at the first excited state: we can either raise ns from 2 to 3, or we can raise either ng or 
ng from 1 to 2, and in either case we get energy P hw. This gives us two possible spin states for (0,0,1,1,3) and 
two possible spin states for (0,0, 1,2, 2), so there is a four-fold degeneracy at the first excited level. 

But one thing we haven't looked at closely yet is the fact that the wavefunction must be totally antisymmetric: 
it must look the same under interchange of any two of the particles, not just those in the same energy level. And the 


way that we construct this wavefunction is to write as a determinant of the following form: 


V5(x1) Wea) ViOa) VI) PCa) 

; Wh(%2) Wel%2) Vie) Vibe) P§x) 
W= Fai det |Yols) Vole) ibs) ViGe) ¥30s)} . 
Wh (xa) Wolxa) W(x) Wy(xa) WE (xa) 


Wh(xs) Wyse) Wi(x%s) Vi(xs) WSs) 


where s is the spin of our particle with n = 2 (either up or down, so there are really two different wavefunctions of 
this form). Indeed, terms of the determinant always include only one term in each row, and indeed if we swap any two 
rows we get a negative sign (which is how we enforce the antisymmetry that we want). (This is known as a Slater 
determinant, and we'll talk more about it in lecture.) 


We'll now discuss the exchange force, explaining how the antisymmetry can be physically enforced. 


Example 210 
Suppose we have an electron in Boston and an electron in Los Angeles: we want to know if we are required to 


think of the two electrons as indistinguishable, so that we need to antisymmetrize the wavefunction, and we want 


to understand what that antisymmetrization does. 


If we started with two distinguishable particles, our wavefunction for these two particles would look like 


Wax, X2) = Wal) Yr (2), 


where Wr is localized “here” and wr Is localized “there.” If our particles become indistinguishable, we then have to 


look at wavefunctions of the form 


Ss Wl Wr (2) Va (2)¥r (a). 


W+ (x1 , x2) = 
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We'll see what this (anti-)symmetrization does by calculating the expectation value of (% — X2)* in both cases. In the 


distinguishable case, we have 
(al (1 — %)?| a) = : x1 dx2|ha(x1, X2)|?(x1 — x2), 


and expanding out the square and noting that wg is a product wavefunction gives us 


= (/ ahs x2 bua)? (/ chal?) =f (/ cham u(oa)) (/ craralrl?) +( f arabs) (/ cro lbr?) 


and this simplifies to 
= (x7) + OX?) 7 — (x) u(x). 


But now if we instead use the ws states and find the expectation value, we find that (through a very similar calculation 


for each of the two terms) 


(ba| (Ri — &)? |W) = (x?) + 0X?) 7 — 2x) ax) 7 F 2 (Waller) (rl x|pa) - 


In other words, we end up with an extra term which we can think of as an off-diagonal term 2|(x)47|* of the matrix, 


and it’s what we call the exchange force. This tells us then that the symmetric version w, has particles that tend to 
be closer together than distinguishable, and the antisymmetric version w_ has particles that tend to be farther apart. 

But if we imagine our electrons that are in Boston and Los Angeles, respectively, the contribution of this exchange 
force will be very small: 


(Walxibr) = / dxibe(xdxbr(X), 


and if the electron wavefunctions are localized far apart, the effect will be negligibly small. 


We'll close with a few facts that are useful to keep in mind: 


Fact 211 
Goldhaber's experiment tried to find out whether G-particles (coming from @-decay) are actually electrons. He 
did this by having particles bounce off of lead and looking at whether there were outgoing electrons that came 


from particles that went into the ground state. 


Fact 212 


The At* particle has spin 3 and is formed by three up-quarks (each spin 1/2 fermions). Then the ground state 


of At* is spin-symmetric and also spatially-symmetric, but we know that A** must be anti-symmetrizable. And 


that’s what lead to the discovery of a new quantum number, which we call color. 


Fact 213 

The hydrogen molecule’s rotation states can be classified by the Y¢ mS of the particles and also by the spin state. If 
we imagine a rotational mode of this hydrogen molecule and we attempt to do the antisymmetrization procedure, 
it turns out that for even & we have a spatially even wavefunction and S = 0, and for odd 2 we have a spatially 
odd wavefunction and S = 1. So there are two types of (stable) hydrogen gas with different spins, and this is 


what we call “ortho hydrogen” and “para hydrogen.” 
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50 May 20, 2021 


We started talking about identical particles last time, thinking about the wavefunction when we have NW such particles. 
The setup is to write the Hilbert space of a single particle V as being spanned by some orthonormal basis |/), so that 
the N-particle Hilbert space is V @V @--- @V. If we then define the transposition operator Prem) which swaps the 
states for the éth and mth particle, we require that our wavefunction is an eigenstate of P(gm) for all £ A m, with 
eigenvalue 1 for bosons and —1 for fermions. (So the wavefunction |) lives in the symmetric subspace for bosons 
and antisymmetric subspace for fermions.) 

At the end of the last lecture, we noted that if we want to construct a symmetric version of a state |/1), |/2)5--- lin) y 


we can consider all permutations of {1,--- , N} (there are N! of them). The bosonic state can then be written as 
Pde = Do Viecay)s Ve2r)o°** Vet) 
acSy 


Indeed, summing over all permutations means that we have Pygmy |W) g = |) g (because the set of permutations forms 


a group). 


Example 214 


With N = 3, there are 6 permutations. If we start with a state |/1), |/2)> |/g)3, then we have the bosonic state 


Ith) = Wi)1 [2)2 [ada + [2)1 |4)2 ba) + [a)s [3)2 [ada + [3)1 2) 2 a) + |e) V3) 2 La) + [3)1 |4)2 2) - 


Note that the normalization factor can be slightly different depending on the values of (1, io, i3: if they are all 


distinct, then all N! terms here are orthogonal, and thus our normalization factor is Sar But things can look a little 
different if some of those state labels are the same. 
But for fermions, we know that we want a negative sign coming out of transposition, so it turns out that basis 


states for the antisymmetric subspace will take a similar form but with some additional factors of —1. 


Definition 215 
Let a € Sy be a permutation. The sign of a, denoted sgn(q), is defined as follows: if we can write a as a product 


(composition) of k transpositions, then it turns out k’s parity is well-defined. We then let sgn(a) = (—1)*. 


Example 216 
For the case N = 3, the permutation 123 (identity) has sign 1, the permutations 213, 132, 321 (one transposition) 
have sign —1, and the permutations 231, 312 (cyclic) can be written as the product of transpositions (12)(23) 


and (23)(12), respectively, so they have sign 1. 


It’s important to note that we evaluate something like (12)(23) from right to left: for example, we can check that 
2 gets sent to 3, because the transposition (23) sends 2 to 3 and then (12) keeps 3 where it is. 


We can now define the fermionic state to be 


LY) = D5 sam(e) Jina), |iat2y)o°** [fac )y « 


aeSy 


Indeed, acting by Pygmy |W) = will indeed give us an additional minus sign in |) ,-, because we're essentially “adding one 


transposition” to each of our permutations. For the N = 3 case we've been explicitly analyzing, the fermionic state 
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will look like 


IW) 3 = |) |2)0 |’3)3 — Ve) |) 113) 3 — |) 2 143) 2 Ia) 3 — Va) Ve) I) 3 + |) [3)o Ia) 3 + lis) 1 |) 2 |2)3 - 


And in this case, we cannot have two of the state labels equal at all: if we let 4 = jo, then pairs of terms will cancel 
out with each other (specifically those which only differ by the transposition P12)), and we're left with the state 0, 
which is not valid. So the Pauli exclusion principle does show up as we expect, and this means that we will always have 
a normalization factor of Tl because the N! terms here are all orthonormal. 

So far, we've been dealing only with countable bases, but we may also want to ask about situations like position- 
space wavefunctions with an uncountable basis. Our Hilbert space for single-particle wavefunctions is then V = L?(R3), 


and we can write our single-particle wavefunctions as w(7), so that our N-particle wavefunctions look like 
W(TL, Fa,+++ FN). 


The constraint that we need to put on our wavefunction is then 


Wha ac Tint In) = EWA in a Pn) 


with + depending on whether we have bosons or fermions. This time, if we work in a basis ¢m(7) where m labels our 


states, then we can replace our |m1), |/M2)5---|My)y notation with 
WA Tn) = Om (A) Om (2) +++ Onn (Fv): 
And similarly to before, bosonic wavefunctions will be 


pelfA, et Ty) = S- Praca) (7. ) Praca (72) :. Pian (IN) 
acSy 
(We can also check that we can equivalently permute the 7s instead of the labels m;, since we’re summing over all 


permutations.) Our fermionic states again also just give us some additional factors of —1: 


We(Ases sFv) = D7 san(e)bmgc (Fi) Pac (2) ++ Pmacy Fav): 
acSy 
We can write this last wavefunction in a more compact manner as well: if we define an Nx N matrix A via Ajj = bm, (7), 


meaning that 


dm(A) Om) -++ Om (TN) 
Ae dm (Ai) Om (2) -*+ Om (Tv) 
Pmy (1) Pmy (12) aaa Pmy (Tv) 
then the fermionic wavefunction wer is actually just the determinant of A (because the formula for the determinant 
also has us summing over permutations Ajj); with an additional sgn(q@) factor in front). This is known as a Slater 
determinant (as mentioned in recitation), and it turns out that the permanent of A (which is like the determinant 
but without the sign factors) gives us the bosonic wavefunction as well. 


More generally, particles can have both a position and a spin, and thus the Hilbert space for a single particle is 
V= L?(R°) Q C2stt 


where C?5+! is the span of the (25+ 1) possible states for a spin s particle, {|o) :o = —s,-s+1,---,s—1s}. Our 
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single-particle wavefunction then generally looks like 
|) particle = S- Wo(F) |o) 
oO 


(For example, for a spin 1/2 particle, our system can be in the up state or the down state, so |W) = W(F,+)|+) + 


w(r,—)|—).) Therefore, states for the N-particle wavefunction can be written 


Wu = S> Wi, 01; 7, 025+++ 5 Fv, On) |O1)1 |02)9°- |ON) py - 


We now have symmetry or antisymmetry conditions, and the idea is that we must have 


WU Oise I Ob? ie Tag The Op) = SW O15 > Ph ins FT GE TN Ga) 


for the position part of the wavefunction. But because we have both a spin and a position part, we can’t immediately 


say something about the + sign here. 


Example 217 


We'll look at the special case where we have two particles of spin 1/2. 


In this situation, we can always decompose the spin wavefunction into symmetric states of the form 
I+)1 1+). 


I) = 04(A.)) 4 Se(14)1 D2 + +): H)2 
I-)1I-)s 


or the antisymmetric state 


WV) = 6-H Se. 


So now if we have fermions, then the combined wavefunction must be antisymmetric. Therefore, 64(/%, 7%) must be 
antisymmetric, while ¢_ (71, %) must be symmetric, because the corresponding spin parts are symmetric and antisym- 
metric, respectively. And if we have bosons, we must have ,, @_ being symmetric and antisymmetric, respectively. 
We'll now discuss a more economical way of representing these symmetric and antisymmetric states using oc- 
cupation numbers. The idea is that if we have bosons and start in a state |/1), |fo)>---|/w)q, then |), and |p), 
essentially sum over permutations of those labels, and the same number of /s will be present in each of those terms. 
So we can just specify how many |1)s, |2)s, |3)s, and other states show up in our original |/,), |fo)>--- |i) yi letting n; 


be the number of particles in single-particle state /, we get the representation in the number basis 


|p) =|m,M2,M3,-°-), 


where nj € Zso for bosons and nj € {0,1} for fermions. 


Example 218 


Consider a Hilbert space for a single particle spanned by |++) and |—) (ignore spatial components, and pretend that 


this is valid for both bosons and fermions). 


We described above that there are three symmetric basis wavefunctions for bosons, and they correspond to occu- 
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pation numbers via 


4), +2 ae ee 
gilt) l-)at+l-)ilt)2 = lap = 10 =1), 
lee elas = |n, =0,n_=2). 


On the other hand, the only antisymmetric basis state is 
i 
v2 


(Note that the number basis is only defined within either the symmetric or antisymmetric basis, so there is no ambiguity 


(1 -ya = |-)a fe) = te = Le = 1). 


between the two |n, = 1, n_ = 1)s since we need to specify our subspace first.) This kind of notation is particularly 
useful when we have many particles, and it’s cleaner conceptually because we know that the total sum of the numbers 


should always be WN. 


Remark 219. Note that |n+ = 1,n_ = 1) can represent either (| +), |=)o—|-)4 |+)5) or (| 1 +)2—-|+)1 |-)o) 
— this gives us an overall phase factor, and we need a convention for it. What we usually do Is to order our single-particle 


states and choose the sign so that we have a coefficient of 1 in front of the one where iy < ip < +++ < Ip. 


We can now turn to a discussion of operators acting on these Hilbert spaces and how obserables behave. Because 
our setup can’t distinguish between particles, the observables also shouldn't be able to do so, meaning that any 


observable A must satisfy [A, Piem)] = 0. In particular, we must have [H, Pim] = 0. 


Example 220 


Suppose that we have a Hamiltonian of the form H = cae bes + U(A) + U(%) + V(A — %). Then we have 


2m 


[H, Pi2] = 0 as long as V(r) = V(-P). 


NB; 
i=1 2m 
an atom or a solid), the easiest situation for us is when the particles do not interact at all, meaning that we can write 


N 
H= omg 
i=1 


More generally, if we have H = >> + yan U(Fi) + i<j Vi — FF) (which describes a system of electrons in 


where Hi) =1@1®---@ Hg ®---@! only acts on the /th particle. Then if the spectrum of Hsp can be described as 
Hep |m) =Em|m), me Zy,, 
then eigenstates of the multiparticle Hamiltonian will look like 
Ib) = |7m)i | M2)9°°- |v) EB = Em + + Em. 


If we now consider this state along with all of its permutations, all of those states will be eigenstates with the same 
energy, and thus the superposition will also be a superposition with said energy E. So this is how we can find bosonic 
and fermionic states for a non-interacting H: we symmetrize or antisymmetrize accordingly as we've already done. 
And just like before, we can use the occupation number representation using the single-particle eigenstates as basis: if 


it looks like |n1, M2, 3,---), then the energy of our eigenstate is 


EC ieie c= ) E;1n;. 
F 
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Finding the ground states of this system now depends on whether we have bosons or fermions: if we have bosons, we 
can Just put all N of the particles into the ground state, and our energy is just N times the energy of the single-particle 
ground state (and this leads us to Bose-Einstein condensation). But if we have fermions, we can only have one particle 
per state, and thus the ground state in the number basis is |1,1,1,1,--- ,1,0,0,---). 

And we can use this to explain the structure of atoms — ignoring the Coulomb repulsion between electrons and 
thinking of them as N independent particles, each electron has the Coulomb energy levels E,, = —Z? BSN meaning 
that they can be in any of the 1s, 2s, 2p,--- states. But then we fill in electrons from lowest to highest energy: each 
of these levels can be doubly degenerate because the electron can have spin up or down. That's why the first shell has 
2 electrons, the second has 2+ 6 = 8 electrons, and so on! (The Coulomb repulsion will modify this, but the main 


features still remain.) 
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